New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Chapter 8<br />
Conclusion and Future<br />
Directions<br />
8.1 Conclusion<br />
In this <strong>the</strong>sis we have described a new pipeline <strong>for</strong> preprocessing, processing<br />
and analysis <strong>of</strong> Time-Of-Flight <strong>Mass</strong> Spectrometry proteomics data. We have<br />
shown <strong>the</strong> application <strong>of</strong> this web-based framework in <strong>the</strong> area <strong>of</strong> clinical<br />
diagnostics to detect molecular patterns (fingerprints) <strong>of</strong> diseases and used<br />
<strong>the</strong>se to classify unknown datasets (spectra). <strong>New</strong>ly developed algorithms<br />
allow <strong>the</strong> detection <strong>of</strong> very small signals that presumably represent very low<br />
abundant molecules, such as hormones. Fingerprints including <strong>the</strong>se very<br />
small signals can be much more sensitive compared to standard approaches<br />
that do not detect <strong>the</strong>m.<br />
The datasets are typically very large (several Gigabytes per patient). To<br />
enable handling <strong>of</strong> <strong>the</strong>se large datasets we have also introduced a new distributed<br />
computing plat<strong>for</strong>m that allows creation <strong>of</strong> heterogeneous ad-hoc<br />
Grids. To demonstrate <strong>the</strong> universality <strong>of</strong> this framework we have shown in<br />
a case study <strong>the</strong> integration <strong>of</strong> a Playstation 3 to per<strong>for</strong>m data preprocessing<br />
tasks.<br />
In <strong>the</strong> current stage <strong>of</strong> this work it is possible to reliably detect multicomponent<br />
fingerprints <strong>for</strong> a given disease. The components <strong>of</strong> a fingerprint<br />
represent particular molecular species (peptides or proteins) contained in some<br />
body fluid (e.g. blood serum) that significantly differentiate in concentration<br />
between two patient groups, such as healthy vs. diseased. These molecules -<br />
once identified - can be used as so called biomarkers. A biomarker is defined<br />
as a molecular, biological or physical characteristic (e.g. DNA sequences,<br />
antibodies or - as in our case - proteins) that indicates a specific physiologic<br />
state which can be directly linked to <strong>the</strong> clinical manifestations and outcome<br />
<strong>of</strong> a particular disease.<br />
After identification <strong>of</strong> a disease specific set <strong>of</strong> biomarkers subsequent studies<br />
can be designed to eventually find drugs that can target <strong>the</strong>se biomarkers<br />
and hopefully open up new steps <strong>for</strong> actually curing this disease.<br />
However, until this vision becomes reality <strong>the</strong>re are some open problems<br />
that need fur<strong>the</strong>r investigation to allow <strong>for</strong> a deeper understanding and thus reliable<br />
and comprehensible mass-spectrometry based clinical diagnostics. These<br />
open problems include:<br />
169