13.01.2013 Views

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

238 Weljie and Heringa<br />

these databases). Similar functionality to SRS is found in other tools such as the<br />

Entrez system.<br />

3. In certain cases, it may not be useful to align the entire sequences such as with<br />

sequences of differing size or containing repeats. For example, if one uses a global<br />

alignment routine and has sequences that vary from less than 100 amino acids<br />

to greater than 1000, the program will generally distort the shorter sequences in<br />

attempting to create the global alignments. In this case, it might be advantageous<br />

to use a local multiple alignment method, such as DIALIGN (47). A local pairwise<br />

alignment tool such as BLAST can also be used to find local matches, and only<br />

these portions of the sequence of interest can then be aligned using a global alignment<br />

routine. This might be particularly appropriate in examining EF-hand or<br />

C2-domain proteins from different subfamilies, as these sequences also contain a<br />

variety of noncalcium-binding domains. The well-annotated SWISSPROT database<br />

will give the specific residues for each entry belonging to a particular<br />

domain, and this can be useful for extracting only the calcium-binding domains.<br />

It should be noted that for reliable alignments, smaller sequences (approx 30 amino<br />

acids) generally require slightly higher identities (>25 %) for reliable alignments,<br />

as compared to longer sequences. Repeats are also difficult as there may be certain<br />

sections of proteins that align better with incorrect topology (e.g., the C-terminal<br />

end of one sequence with the N-terminus of another).<br />

4. The amino substitution weights are normally given as a 20 × 20 matrix, containing<br />

the weights for all possible amino acid exchanges. A multitude of such matrices<br />

are available, of which the most widely used are the classic PAM250 (50), the<br />

BLOSUM62 (27), and the Gonnet (51) matrix. The insertion/deletion penalties<br />

are used to decrease the alignment score when gaps need to be made to optimally<br />

match the two sequences. Normally, a pair of gap penalties is used, consisting of<br />

an opening penalty used once for each gap and an extension penalty applied to<br />

each incurring gap position. In practice, recommended values for the gap penalties<br />

are available for most amino acid exchange matrices (such as 10 and 1 for the<br />

PAM250 or 12 and 1 for the BLOSUM62 matrix, for gap opening and extension,<br />

respectively). For divergent sequences, lowered settings for the gap penalties<br />

could be attempted. Some alignment programs, such as CLUSTAL or MULTAL<br />

(52) employ optimized combinations of exchange matrices with associated gap<br />

penalties, which makes it less straightforward to vary gap penalties and/or try<br />

other residue exchange matrices.<br />

5. Although multiple alignments can confer a wealth of information, numerous<br />

avenues of further analysis are open to the researcher in which an alignment plays<br />

a key role. It should be mentioned, however, that each mode of analysis often<br />

requires specific types of sequences to be included in the alignment together with<br />

associated information. For example, homology modeling requires sequences<br />

whose 3-D coordinates are known, and some types of phylogenetic trees require<br />

outlier sequences to be included. The successful generation of alignments useful<br />

in these applications may require the researcher to perform the alignment several<br />

times in an iterative fashion before conditions are fully satisfied, albeit care in

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!