BLAST and BLAT - Algorithms in Bioinformatics
BLAST and BLAT - Algorithms in Bioinformatics
BLAST and BLAT - Algorithms in Bioinformatics
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Bio<strong>in</strong>formatics I, WS’12/13, D. Huson, October 30, 2012 23<br />
3.6 Available <strong>BLAST</strong> implementations<br />
Implementations:<br />
• NCBI <strong>BLAST</strong>: Implementation of all <strong>BLAST</strong> programs ma<strong>in</strong>ta<strong>in</strong>ed by NCBI.<br />
• AB-<strong>BLAST</strong> (former WU-<strong>BLAST</strong>): Alternative implementation of all <strong>BLAST</strong> programs (except<br />
for PHI- <strong>and</strong> PSI-<strong>BLAST</strong>), needs a license (which costs money)<br />
• There are a number of commercial implementations of <strong>BLAST</strong> on GPUs <strong>and</strong> specialized hardware<br />
3.7 <strong>BLAT</strong><br />
<strong>BLAT</strong> = Blast Like Alignment Tool 3<br />
<strong>BLAT</strong> is an alternative to <strong>BLAST</strong> that was developed <strong>in</strong> the context of the Human Genome Project.<br />
Its targets the comparison of genomes (e.g. human <strong>and</strong> mouse) or mRNA/DNA <strong>and</strong> cross-species<br />
prote<strong>in</strong> alignments.<br />
Task that motivated development of <strong>BLAT</strong>: Map three million ESTs 4 (1.7 Gb) <strong>and</strong> 13 million whole<br />
gnome shotgun reads to a draft sequence of the human genome (2.88 Gb).<br />
3.8 <strong>BLAT</strong> vs <strong>BLAST</strong><br />
<strong>BLAT</strong> is similar to <strong>BLAST</strong>: The program rapidly scans for relatively short matches (hits) <strong>and</strong> extends<br />
these <strong>in</strong>to HSPs. However <strong>BLAT</strong> differs from <strong>BLAST</strong> <strong>in</strong> some important ways:<br />
• <strong>BLAST</strong> builds an <strong>in</strong>dex of the query str<strong>in</strong>g <strong>and</strong> then scans l<strong>in</strong>early through the database – <strong>BLAT</strong><br />
builds an <strong>in</strong>dex of the database <strong>and</strong> then scans l<strong>in</strong>early through the query,<br />
• <strong>BLAST</strong> triggers an extension when one or two hits occur – <strong>BLAT</strong> can trigger extensions on any<br />
given number of perfect or near perfect seed matches,<br />
• <strong>BLAST</strong> returns each area of homology as separate alignments – <strong>BLAT</strong> cha<strong>in</strong>s them together<br />
<strong>in</strong>to larger alignments,<br />
• <strong>BLAST</strong> delivers a list of exons sorted by size, with alignments extend<strong>in</strong>g slightly beyond the<br />
edge of each exon – <strong>BLAT</strong> “unsplices” mRNA onto the genome, giv<strong>in</strong>g a s<strong>in</strong>gle alignment that<br />
uses each base of the mRNA only once, with correctly positioned splice sites.<br />
3.9 Sensitivity <strong>and</strong> Specificity<br />
Classifications:<br />
An “event” or “signal” (such as a DNA sequences is orthologous to a second one, a given DNA sequence<br />
is conta<strong>in</strong>ed <strong>in</strong> a given cod<strong>in</strong>g region, or a gene is differentially expressed etc.) can<br />
• be predicted to occur: Predicted Positive<br />
• be predicted not to occur: Predicted Negative<br />
3 W. J. Kent: <strong>BLAT</strong> - The <strong>BLAST</strong>-Like Alignment Tool. Genome Res. 12:656-664(2002)<br />
4 An expressed sequence tag or EST is a short sub-sequence of a cDNA sequence used <strong>in</strong> the <strong>in</strong>vestigation of genes.