New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Extended Abstract<br />
English Version<br />
<strong>Mass</strong> spectrometry (MS) based techniques have emerged as a standard <strong>for</strong><br />
large-scale protein analysis. The ongoing progress in terms <strong>of</strong> more sensitive MS standard <strong>for</strong> large scale protein<br />
analysis<br />
machines and improved data analysis algorithms led to a constant expansion <strong>of</strong><br />
its fields <strong>of</strong> applications. Recently, MS was introduced into clinical proteomics<br />
with <strong>the</strong> prospect <strong>of</strong> early disease detection using proteomic pattern matching. MS <strong>for</strong> early disease detection<br />
Analyzing biological samples (e.g. blood) by mass spectrometry generates<br />
mass spectra that represent <strong>the</strong> components (molecules) contained in a sample<br />
as masses and <strong>the</strong>ir respective relative concentrations. It is well known that <strong>Mass</strong> spectra represent relative<br />
an individual’s proteome is highly dynamic and changes quite dramatically<br />
during a day, depending on a variety <strong>of</strong> factors. However, analyzing a large<br />
enough group <strong>of</strong> similar individuals (e.g. “healthy” or “suffering from disease<br />
concentrations <strong>of</strong> molecules in a<br />
sample<br />
X”) allows to identify components in <strong>the</strong> respective spectra that do not differ An individual’s proteome is highly<br />
much - with respect to concentration - between individuals from <strong>the</strong> same<br />
group (constant components).<br />
In this work, we are interested in those components that are constant<br />
within a group <strong>of</strong> individuals but differ much between individuals <strong>of</strong> two distinct<br />
groups. These distinguishing components that dependent on a particular<br />
medical condition are generally called biomarkers. Since not all biomarkers<br />
dynamic and influenced by e.g.<br />
diseases<br />
found by <strong>the</strong> algorithms are <strong>of</strong> equal (discriminating) quality we are only in- In this <strong>the</strong>sis: search <strong>for</strong> spectra<br />
terested in a small biomarker subset that - as a combination - can be used<br />
components that reflect particular<br />
diseases<br />
as a fingerprint <strong>for</strong> a disease. Once a fingerprint <strong>for</strong> a particular disease (or Best components can be combined to<br />
a disease’s fingerprint<br />
medical condition) is identified, it can be used in clinical diagnostics to classify<br />
unknown spectra.<br />
This mass spectrometry based method appears to be one <strong>of</strong> <strong>the</strong> arising key<br />
technologies <strong>for</strong> biomarker discovery, understanding <strong>of</strong> biological mechanisms,<br />
and consequently, it might <strong>of</strong>fer new approaches in drug development.<br />
In this <strong>the</strong>sis we have developed new algorithms <strong>for</strong> automatic extraction<br />
<strong>of</strong> disease specific fingerprints from mass spectrometry data. Special empha- <strong>New</strong> algorithms <strong>for</strong> automatic<br />
sis has been put on designing highly sensitive methods with respect to signal<br />
extraction <strong>of</strong> fingerprints have been<br />
developed<br />
detection. This is extremely important in all stages <strong>of</strong> <strong>the</strong> pipeline (such as Focus was laid on developing highly<br />
spectra preprocessing, signal detection, signal analysis and identification <strong>of</strong><br />
disease specific fingerprints) since many biologically relevant molecules are<br />
found to be very low abundant (such as hormones) thus yielding (comparatively)<br />
small signals. Thanks to our statistically based approach our methods<br />
are able to detect signals even below <strong>the</strong> noise level inherent in data acquired<br />
by common MS machines.<br />
To provide access to <strong>the</strong>se new classes <strong>of</strong> algorithms to collaborating groups<br />
we have created a web-based analysis plat<strong>for</strong>m that provides all necessary<br />
sensitive algorithms to allow<br />
detection <strong>of</strong> low abundant molecules<br />
within noise<br />
interfaces <strong>for</strong> data transfer, data analysis and result inspection. Following A new analysis plat<strong>for</strong>m <strong>for</strong> signal<br />
(pre-)processing and extraction <strong>of</strong><br />
disease specific fingerprints was<br />
developed<br />
1