29.01.2013 Views

Applied Biosystems SOLiD™ 4 System SETS Software User Guide ...

Applied Biosystems SOLiD™ 4 System SETS Software User Guide ...

Applied Biosystems SOLiD™ 4 System SETS Software User Guide ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Sampling<br />

Allele ratio<br />

<strong>Applied</strong> <strong>Biosystems</strong> SOLiD 4 <strong>System</strong> <strong>SETS</strong> <strong>Software</strong> <strong>User</strong> <strong>Guide</strong><br />

Appendix B Advanced Topic: Data Analysis Overview<br />

Data analysis considerations B<br />

If coverage at a genome position is low, the second allele might often<br />

not be sampled, even if it is present. Heterozygotes are expected to be<br />

underrepresented at low coverage. (They can be called as a<br />

homozygote for one of the two alleles.) It is important to have a low<br />

false positive rate for SNP detection, because SNPs are expected to<br />

exist at only 1 in 1,000 positions for humans and some other species.<br />

The false positive rate in an individual species should be at least an<br />

order of magnitude lower than this, to avoid a high false discovery<br />

rate. An error model that incorporates quality values (QV) can<br />

increase the overall accuracy of SNP detection. The SNP detection<br />

algorithms of the SOLiD 4 <strong>System</strong> use explicit error models<br />

generated from each run and biologically meaningful prior<br />

probabilities to evaluate evidence of a SNP. 2-base encoding provides<br />

a built-in way to distinguish errors from true alleles to manage the<br />

false positive rate.<br />

If the sample preparation method or some other cause produces<br />

results in allele ratios that are considerably different from the<br />

expected 50:50 ratio, it is more difficult to detect heterozygosity.<br />

Detecting heterozygosity then requires more sequence coverage.<br />

Find polymorphisms in color-space<br />

The SOLiD 4 <strong>System</strong> can detect complicated genomic variations<br />

such as adjacent SNPs, insertions, deletions, and structural<br />

rearrangements.<br />

For other analysis tools, visit the AB SOLiD <strong>Software</strong> Community<br />

website:<br />

http://www3.appliedbiosystems.com/AB_Home/applicationstechnol<br />

ogies/SOLiD<strong>System</strong>Sequencing/<strong>Software</strong>CommunityDataAnalysis<br />

ResourcesforScientistsDevelopers/index.htm<br />

An example of various polymorphisms in color-space is shown in the<br />

figure below.<br />

155

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!