08.02.2013 Views

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 8<br />

Conclusion and Future<br />

Directions<br />

8.1 Conclusion<br />

In this <strong>the</strong>sis we have described a new pipeline <strong>for</strong> preprocessing, processing<br />

and analysis <strong>of</strong> Time-Of-Flight <strong>Mass</strong> Spectrometry proteomics data. We have<br />

shown <strong>the</strong> application <strong>of</strong> this web-based framework in <strong>the</strong> area <strong>of</strong> clinical<br />

diagnostics to detect molecular patterns (fingerprints) <strong>of</strong> diseases and used<br />

<strong>the</strong>se to classify unknown datasets (spectra). <strong>New</strong>ly developed algorithms<br />

allow <strong>the</strong> detection <strong>of</strong> very small signals that presumably represent very low<br />

abundant molecules, such as hormones. Fingerprints including <strong>the</strong>se very<br />

small signals can be much more sensitive compared to standard approaches<br />

that do not detect <strong>the</strong>m.<br />

The datasets are typically very large (several Gigabytes per patient). To<br />

enable handling <strong>of</strong> <strong>the</strong>se large datasets we have also introduced a new distributed<br />

computing plat<strong>for</strong>m that allows creation <strong>of</strong> heterogeneous ad-hoc<br />

Grids. To demonstrate <strong>the</strong> universality <strong>of</strong> this framework we have shown in<br />

a case study <strong>the</strong> integration <strong>of</strong> a Playstation 3 to per<strong>for</strong>m data preprocessing<br />

tasks.<br />

In <strong>the</strong> current stage <strong>of</strong> this work it is possible to reliably detect multicomponent<br />

fingerprints <strong>for</strong> a given disease. The components <strong>of</strong> a fingerprint<br />

represent particular molecular species (peptides or proteins) contained in some<br />

body fluid (e.g. blood serum) that significantly differentiate in concentration<br />

between two patient groups, such as healthy vs. diseased. These molecules -<br />

once identified - can be used as so called biomarkers. A biomarker is defined<br />

as a molecular, biological or physical characteristic (e.g. DNA sequences,<br />

antibodies or - as in our case - proteins) that indicates a specific physiologic<br />

state which can be directly linked to <strong>the</strong> clinical manifestations and outcome<br />

<strong>of</strong> a particular disease.<br />

After identification <strong>of</strong> a disease specific set <strong>of</strong> biomarkers subsequent studies<br />

can be designed to eventually find drugs that can target <strong>the</strong>se biomarkers<br />

and hopefully open up new steps <strong>for</strong> actually curing this disease.<br />

However, until this vision becomes reality <strong>the</strong>re are some open problems<br />

that need fur<strong>the</strong>r investigation to allow <strong>for</strong> a deeper understanding and thus reliable<br />

and comprehensible mass-spectrometry based clinical diagnostics. These<br />

open problems include:<br />

169

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!