15.12.2012 Views

Bioinformatics, Volume I Data, Sequence Analysis and Evolution

Bioinformatics, Volume I Data, Sequence Analysis and Evolution

Bioinformatics, Volume I Data, Sequence Analysis and Evolution

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

92 Durinck<br />

1.5. <strong>Data</strong> Filtering<br />

1.6. Detection of<br />

Differentially<br />

Expressed Genes Genes<br />

This type of normalization can be used under the assumption<br />

that only a small subset of genes is differentially expressed<br />

between the two samples. If this assumption does not hold, one<br />

can rely on spiked-in RNA to calculate the lowess fit <strong>and</strong> use<br />

that fit to adjust the data (11). Other frequently used methods<br />

to normalize cDNA microarrays are qspline (12) <strong>and</strong> VSN (7)<br />

normalization.<br />

A between-slide normalization, which corrects for a difference<br />

in scale, can be applied if the variance of the ratios differs a lot<br />

between the different slides. A boxplot of the ratios grouped per<br />

slide will reveal the necessity of such between-slide normalization.<br />

ANOVA (13) is an alternative to the types of normalization<br />

methods described in the preceding <strong>and</strong> aims to estimate the<br />

size of different effects, including dye, gene, array, <strong>and</strong> sample.<br />

Changes in gene expression across the samples are estimated by<br />

the sample x gene interaction terms of the model (13).<br />

Prior to detection of differentially expressed genes, a data-filtering<br />

step can be applied. It is recommended to eliminate all genes<br />

that are not expressed over all samples.<br />

For cDNA microarrays, genes for which the foreground<br />

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!