Applied Biosystems SOLiD™ 4 System SETS Software User Guide ...
Applied Biosystems SOLiD™ 4 System SETS Software User Guide ...
Applied Biosystems SOLiD™ 4 System SETS Software User Guide ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
SNPs and errors<br />
High accuracy for SNP detection<br />
<strong>Applied</strong> <strong>Biosystems</strong> SOLiD 4 <strong>System</strong> <strong>SETS</strong> <strong>Software</strong> <strong>User</strong> <strong>Guide</strong><br />
Appendix B Advanced Topic: Data Analysis Overview<br />
Data analysis considerations B<br />
Because a SNP generates two adjacent and valid color-space<br />
mismatches to the reference sequence, the software allows a<br />
minimum of two color-space mismatches during alignment to the<br />
reference sequence. Depending on the complexity of the alignment,<br />
the optimum number of mismatches in alignment is at least three. If<br />
only two errors are allowed, then whenever there is a SNP (two<br />
mismatches) and a measurement error, the third error causes<br />
omission of that read. You can override the omission by using an<br />
alignment program setting that counts two adjacent mismatches as a<br />
single mismatch. Therefore, two adjacent mismatches and a<br />
measurement error are classified as two errors, whereas three nonadjacent<br />
errors are classified as three errors.<br />
2-base encoding provides high system accuracy and built-in error<br />
checking because it enables discrimination between measurement<br />
errors and true sequence polymorphisms. The SOLiD software<br />
interrogates each base twice in two independent reactions.<br />
Information about each base is included in two adjacent pieces of<br />
color-space data. Because the system uses four fluorescent dyes,<br />
there are 16 possible two-color combinations (Figure 7).<br />
Figure 7 Color-space transitions<br />
153