11.04.2013 Views

BLAST and BLAT - Algorithms in Bioinformatics

BLAST and BLAT - Algorithms in Bioinformatics

BLAST and BLAT - Algorithms in Bioinformatics

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Bio<strong>in</strong>formatics I, WS’12/13, D. Huson, October 30, 2012 23<br />

3.6 Available <strong>BLAST</strong> implementations<br />

Implementations:<br />

• NCBI <strong>BLAST</strong>: Implementation of all <strong>BLAST</strong> programs ma<strong>in</strong>ta<strong>in</strong>ed by NCBI.<br />

• AB-<strong>BLAST</strong> (former WU-<strong>BLAST</strong>): Alternative implementation of all <strong>BLAST</strong> programs (except<br />

for PHI- <strong>and</strong> PSI-<strong>BLAST</strong>), needs a license (which costs money)<br />

• There are a number of commercial implementations of <strong>BLAST</strong> on GPUs <strong>and</strong> specialized hardware<br />

3.7 <strong>BLAT</strong><br />

<strong>BLAT</strong> = Blast Like Alignment Tool 3<br />

<strong>BLAT</strong> is an alternative to <strong>BLAST</strong> that was developed <strong>in</strong> the context of the Human Genome Project.<br />

Its targets the comparison of genomes (e.g. human <strong>and</strong> mouse) or mRNA/DNA <strong>and</strong> cross-species<br />

prote<strong>in</strong> alignments.<br />

Task that motivated development of <strong>BLAT</strong>: Map three million ESTs 4 (1.7 Gb) <strong>and</strong> 13 million whole<br />

gnome shotgun reads to a draft sequence of the human genome (2.88 Gb).<br />

3.8 <strong>BLAT</strong> vs <strong>BLAST</strong><br />

<strong>BLAT</strong> is similar to <strong>BLAST</strong>: The program rapidly scans for relatively short matches (hits) <strong>and</strong> extends<br />

these <strong>in</strong>to HSPs. However <strong>BLAT</strong> differs from <strong>BLAST</strong> <strong>in</strong> some important ways:<br />

• <strong>BLAST</strong> builds an <strong>in</strong>dex of the query str<strong>in</strong>g <strong>and</strong> then scans l<strong>in</strong>early through the database – <strong>BLAT</strong><br />

builds an <strong>in</strong>dex of the database <strong>and</strong> then scans l<strong>in</strong>early through the query,<br />

• <strong>BLAST</strong> triggers an extension when one or two hits occur – <strong>BLAT</strong> can trigger extensions on any<br />

given number of perfect or near perfect seed matches,<br />

• <strong>BLAST</strong> returns each area of homology as separate alignments – <strong>BLAT</strong> cha<strong>in</strong>s them together<br />

<strong>in</strong>to larger alignments,<br />

• <strong>BLAST</strong> delivers a list of exons sorted by size, with alignments extend<strong>in</strong>g slightly beyond the<br />

edge of each exon – <strong>BLAT</strong> “unsplices” mRNA onto the genome, giv<strong>in</strong>g a s<strong>in</strong>gle alignment that<br />

uses each base of the mRNA only once, with correctly positioned splice sites.<br />

3.9 Sensitivity <strong>and</strong> Specificity<br />

Classifications:<br />

An “event” or “signal” (such as a DNA sequences is orthologous to a second one, a given DNA sequence<br />

is conta<strong>in</strong>ed <strong>in</strong> a given cod<strong>in</strong>g region, or a gene is differentially expressed etc.) can<br />

• be predicted to occur: Predicted Positive<br />

• be predicted not to occur: Predicted Negative<br />

3 W. J. Kent: <strong>BLAT</strong> - The <strong>BLAST</strong>-Like Alignment Tool. Genome Res. 12:656-664(2002)<br />

4 An expressed sequence tag or EST is a short sub-sequence of a cDNA sequence used <strong>in</strong> the <strong>in</strong>vestigation of genes.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!