01.04.2015 Views

Sequence Comparison.pdf

Sequence Comparison.pdf

Sequence Comparison.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

7.4 BLAST Database Search 143<br />

...<br />

Score = 36.6 bits (83), Expect = 0.11, Method: Compositional matrix adjust.<br />

Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 9/86 (10%)<br />

Query 1 MSLPIIKVH-WLDHSRAFRLLWLLDHLNLEYEIVPYKR-DANFRAPPELKKIHPLGRSPL 58<br />

M+ P +KV+ W R L L+ ++YE+VP R D + R P L + +P G+ P+<br />

Sbjct 1 MATPAVKVYGWAISPFVSRALLALEEAGVDYELVPMSRQDGDHRRPEHLAR-NPFGKVPV 59<br />

Query 59 LEVQDRETGKKKILAESGFIFQYVLQ 84<br />

LE D L ES I ++VL+<br />

Sbjct 60 LEDGDL------TLFESRAIARHVLR 79<br />

...<br />

Database: Non-redundant SwissProt sequences<br />

Posted date: May 23, 2008 5:56 PM<br />

Number of letters in database: 124,438,792<br />

Number of sequences in database: 332,988<br />

Lambda K H<br />

0.320 0.137 0.401<br />

Gapped<br />

Lambda K H<br />

0.267 0.0410 0.140<br />

Matrix: BLOSUM62<br />

Gap Penalties: Existence: 11, Extension: 1<br />

Number of <strong>Sequence</strong>s: 332988<br />

...<br />

Length of query: 234<br />

Length of database: 124438792<br />

Length adjustment: 111<br />

...<br />

The printout above shows that the query sequence has length 234, the number of<br />

letters in database is 124,438,792, the number of sequences in database is 332,988,<br />

and the length adjustment is 111. The Maize Glutathione match listed in the printout<br />

contains gaps. Hence,<br />

Because the raw is 83, the bit score is<br />

λ = 0.267, K = 0.041.<br />

[0.267 − ln(0.041)]/ln(2) ≈ 36.58,<br />

in agreement with the printout value. The value of Expect is<br />

0.041 × (234 − 111) × (124438793 − 332988 × 111) × e −0.267×83 ≈ 0.105,<br />

in agreement with the value 0.11 in the printout. Finally, one can easily check that<br />

the length adjustment 111 is an approximate fixed point of the function in (7.62).<br />

Another partial printout from Wu-BLASTP search against a protein database<br />

is given below, in which only high-scoring alignments between the query and a<br />

database sequence are listed. There are eight alignments in total. For each alignment,<br />

the Karlin-Altschul sum P-value and E-value are reported. We number the

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!