13.01.2013 Views

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Multiple Sequence Alignment 241<br />

Table 3<br />

Summary of Sequence Retrieval Statistics for <strong>Calcium</strong>-<strong>Binding</strong><br />

Domains (Number of Sequences)<br />

Pfam PROSITE After 50% cut-off<br />

EF-hand 790 479 188<br />

C2 domain 109 50 60<br />

Annexin 171 54 37<br />

tiple alignment, although it must be kept in mind that the biological significance<br />

of a particular tree is not addressed.<br />

A researcher wishing to derive a phylogeny from a given set of protein<br />

sequences, should first seek one or more sensitive multiple alignment routines to<br />

construct a good alignment. If secondary or tertiary structures are known, the<br />

information should be applied to check and enhance the alignment. A variety of<br />

phylogenetic methods should then be used (preferably including the NJ method)<br />

in conjunction with bootstrapping and the results compared carefully for consistency.<br />

Paradoxally, many wrong trees that can be derived from a particular sequence<br />

set will lead to far more interesting phylogenies than the one correct tree.<br />

4.2. Specific Practical Examples Using <strong>Calcium</strong>-<strong>Binding</strong> <strong>Protein</strong>s<br />

In order to provide some practical insight into the methodology outlined<br />

above, example alignments of the EF hand, C2 domain, and annexin families<br />

are provided below. The results presented are not intended to provide conclusive<br />

results of any sort, but to demonstrate a general methodology, simple<br />

interpretations of results, possible pitfalls and ways to avoid them. The<br />

emphasis is on alignment of sequences with large identity differences, and on<br />

contrasting the performance of different alignment methods (exemplified here<br />

by the PRALINE and CLUSTALX algorithms) in the alignment of the various<br />

families.<br />

4.2.1. Methodology<br />

The basic procedure used was as described in Subheading 3., and this section<br />

is included to provide some example of the scope and procedures involved<br />

in these alignments. Sequences were extracted from the PROSITE and PFAM<br />

databases using the terms “EF hand,” “C2,” and “annexin” via the SRS interface.<br />

Table 3 provides a summary of the number of sequences available from<br />

each. Because of the variant nature in sequence length and the presence of<br />

repeats, it was decided that the alignments should be performed for the calciumbinding<br />

regions only, with each calcium-binding site having its own sequence<br />

entry. For each of the alignments, the sequences in these two databases were

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!