08.02.2013 Views

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.7. IDENTIFYING POTENTIAL FEATURES 51<br />

Figure 3.7.16: This shows identified masterpeaks <strong>of</strong> two groups (top: men; bottom:<br />

women). The alignment process assigns pairs <strong>of</strong> masterpeaks having similar properties<br />

(such as m/z value). These assignments are indicated by <strong>the</strong> red arrows.<br />

Algorithm<br />

From <strong>the</strong> preprocessing steps we have acquired a list <strong>of</strong> masterpeaks <strong>of</strong> groups<br />

G1, G2 (e.g. cancer group vs. healthy group). To obtain a set <strong>of</strong> masterpeaks<br />

that differ in some defined property (e.g. average height) we per<strong>for</strong>m <strong>the</strong><br />

following key steps:<br />

i) Alignment <strong>of</strong> Masterpeaks in G1, G2 (masterpeak alignment across groups)<br />

ii) Calculation <strong>of</strong> aligned Masterpeak Pair (MPP) property Jensen-Shannon<br />

(JS) differences<br />

iii) Order this list by distances.<br />

This yields a list <strong>of</strong> pairs <strong>of</strong> aligned masterpeaks <strong>of</strong> G1 and G2, ordered by<br />

<strong>the</strong>ir respective distances.<br />

Pseudocode<br />

The basic process is as follows:<br />

Algorithm 4 Extracting features<br />

Require: Lists <strong>of</strong> masterpeaks <strong>of</strong> group 1 & 2, respectively.<br />

{Masterpeak Alignment}<br />

Match <strong>the</strong> masterpeaks <strong>of</strong> group 1 & 2 by algos described in section 3.7.1.<br />

{Calculation <strong>of</strong> Masterpeak Property Differences}<br />

return A sorted list containing tuples <strong>of</strong> aligned pairs <strong>of</strong> masterpeaks with<br />

<strong>the</strong>ir respective distances.<br />

Preprocessing 1: Masterpeak Alignment (Across Groups)<br />

The majority <strong>of</strong> masterpeaks obtained by <strong>the</strong> preprocessing steps occurs in<br />

each group <strong>of</strong> spectra (e.g. S1, S2) at <strong>the</strong> same position and having identical<br />

heights, due to <strong>the</strong> almost identical blood proteome <strong>of</strong> two humans. Remember

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!