01.04.2015 Views

Gene Cloning

Gene Cloning

Gene Cloning

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

226 <strong>Gene</strong> <strong>Cloning</strong><br />

The problem with trying to perform a pair-wise alignment, between<br />

your query sequence and all of the sequences in one of the databases, is<br />

that the databases are too big for it to be practical to perform such an<br />

exhaustive search. However, a number of computational methods have<br />

been developed which will perform a very good approximation to this type<br />

of search. The two most commonly used programs are Basic Local<br />

Alignment Search Tool (BLAST) and Fasta (it stands for FAST-All). Each of<br />

these programs uses a short cut to allow it to produce a list of similar<br />

sequences without performing an exhaustive alignment between the query<br />

sequence and every entry in the database. There are numerous web-based<br />

resources which allow you to run Fasta and BLAST searches. The addresses<br />

for some of these can be found in the Further Reading section at the end of<br />

this chapter.<br />

A classical example, which demonstrates the power of this technique, is<br />

the case of the gene responsible for cystic fibrosis. In this inherited disease,<br />

thick mucus accumulates in the lungs and intestines, which results in frequent<br />

lung infections and malnutrition. Patients also have high levels of<br />

salt in their sweat and are often sterile. Intensive efforts to clone and<br />

sequence the gene responsible culminated in 1989 in the publication, by a<br />

large team led by Lap-Chee Tsui and John Riordan, of the sequence of the<br />

CFTR gene; mutations in this gene give rise to cystic fibrosis. Performing a<br />

similarity search with the CFTR gene today will yield a huge number of similar<br />

sequences, as it belongs to a much studied family of membrane transporter<br />

proteins. However, in the late 1980s the situation was very different<br />

because there were far fewer known sequences. Homology searches at the<br />

time showed that, somewhat surprisingly, the gene responsible for this<br />

genetic disease in humans was similar to a family of proteins responsible<br />

for resistance of bacteria to a wide range of antibacterial agents. These ABC<br />

multidrug transporter proteins function by pumping the drugs out of the<br />

bacteria. This was an important clue as to the function of the CFTR protein,<br />

which is now known to be involved in pumping sodium and chloride ions<br />

across membranes in mammalian cells.<br />

8.10 Fasta<br />

Typically, the output from a Fasta search produces a summary table such<br />

as that shown in Figure 8.12a. The table lists the sequences from the database<br />

which most closely match that of the query sequence. The first column<br />

shows the name of the entry in the database, or database identifier,<br />

followed by an extract from the description of the protein. Many web interfaces<br />

for the Fasta program provide hyperlinks to the database entries so<br />

that you have immediate access to a wealth of information about the proteins.<br />

The other columns give statistical information, which enables you to<br />

assess the match between your query and the database entry. In the example<br />

shown you can see that most of the sequences are of a similar length to

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!