You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
7.4 BLAST Database Search 143<br />
...<br />
Score = 36.6 bits (83), Expect = 0.11, Method: Compositional matrix adjust.<br />
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 9/86 (10%)<br />
Query 1 MSLPIIKVH-WLDHSRAFRLLWLLDHLNLEYEIVPYKR-DANFRAPPELKKIHPLGRSPL 58<br />
M+ P +KV+ W R L L+ ++YE+VP R D + R P L + +P G+ P+<br />
Sbjct 1 MATPAVKVYGWAISPFVSRALLALEEAGVDYELVPMSRQDGDHRRPEHLAR-NPFGKVPV 59<br />
Query 59 LEVQDRETGKKKILAESGFIFQYVLQ 84<br />
LE D L ES I ++VL+<br />
Sbjct 60 LEDGDL------TLFESRAIARHVLR 79<br />
...<br />
Database: Non-redundant SwissProt sequences<br />
Posted date: May 23, 2008 5:56 PM<br />
Number of letters in database: 124,438,792<br />
Number of sequences in database: 332,988<br />
Lambda K H<br />
0.320 0.137 0.401<br />
Gapped<br />
Lambda K H<br />
0.267 0.0410 0.140<br />
Matrix: BLOSUM62<br />
Gap Penalties: Existence: 11, Extension: 1<br />
Number of <strong>Sequence</strong>s: 332988<br />
...<br />
Length of query: 234<br />
Length of database: 124438792<br />
Length adjustment: 111<br />
...<br />
The printout above shows that the query sequence has length 234, the number of<br />
letters in database is 124,438,792, the number of sequences in database is 332,988,<br />
and the length adjustment is 111. The Maize Glutathione match listed in the printout<br />
contains gaps. Hence,<br />
Because the raw is 83, the bit score is<br />
λ = 0.267, K = 0.041.<br />
[0.267 − ln(0.041)]/ln(2) ≈ 36.58,<br />
in agreement with the printout value. The value of Expect is<br />
0.041 × (234 − 111) × (124438793 − 332988 × 111) × e −0.267×83 ≈ 0.105,<br />
in agreement with the value 0.11 in the printout. Finally, one can easily check that<br />
the length adjustment 111 is an approximate fixed point of the function in (7.62).<br />
Another partial printout from Wu-BLASTP search against a protein database<br />
is given below, in which only high-scoring alignments between the query and a<br />
database sequence are listed. There are eight alignments in total. For each alignment,<br />
the Karlin-Altschul sum P-value and E-value are reported. We number the