13.01.2013 Views

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

Calcium-Binding Protein Protocols Calcium-Binding Protein Protocols

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Multiple Sequence Alignment 239<br />

preparing the alignment is certainly worth the effort. During the course of such<br />

optimizations, it is also worthwhile considering the incorporation of external<br />

information garnered from biochemical or structural studies of the proteins in the<br />

alignment.<br />

6. Three basic concepts exist to reconstruct the divergent evolution of a set of<br />

sequences: (a) parsimony (53,54); (b) distance (55); and (c) maximum likelihood<br />

methods (56). Parsimony methods try to reconstruct phylogenies by exploring<br />

the concept of minimum mutation. Distance methods are aimed at exploring a<br />

matrix containing all pairwise distances of a set of multiply aligned sequences.<br />

These methods also try to reconstruct the past using a minimalist approach; i.e.,<br />

using as few evolutionary changes as possible. Maximum-likelihood methods<br />

attempt to construct the most probable tree based on the sequence data and a<br />

specific stochastic model of evolution. The additional information that can be<br />

expressed in the model, such as weighting of functional similarity and amino<br />

acid importance and the nature of insertions/deletions (e.g., ref. 10), can optimize<br />

the connectivity and branch lengths of the resulting tree. The package<br />

MOLPHY (57) is a speed-optimized maximum likelihood method that could be<br />

considered if evolutionary information is the most important analysis. In this<br />

chapter, we will restrict ourselves to distance-based methods as they are quick<br />

and can be applied easily onto the sequence data.<br />

The evolutionary relationships of a subject set of sequences are normally<br />

depicted in a tree. A tree is a special case of a connected graph where travel from<br />

each node to any other is possible through edges (branches) by only one path<br />

between any two such nodes. A tree contains interior and exterior (terminal)<br />

nodes. Normally, the input sequences are contemporary and referred to as the<br />

operational taxonomic units (OTUs). They correspond with the exterior nodes of<br />

the evolutionary tree, whereas the internal nodes represent ancestral sequences<br />

that must be guessed from the OTUs and the tree topology. The length of each<br />

branch connecting a pair of nodes may correspond to the estimated number of<br />

substitutions between two associated sequences. The minimal evolution hypothesis<br />

is that the “true” phylogenetic tree is the rooted tree (i.e., contains a node<br />

ancestral to all other nodes), which has the shortest overall length and thus comprises<br />

the lowest cumulative number of mutations.<br />

Distance methods derive a tree from a distance matrix, in which approximations<br />

are stored of all pairwise evolutionary distances between the tree constituents<br />

(i.e., the sequences). Distances can be obtained from sequence identities<br />

(55) or pairwise sequence alignment scores (58). Normally, an agglomerative<br />

cluster criterion is then used to construct the phylogenetic tree, reflecting the evolutionary<br />

information in the best possible way. Many clustering criteria have been<br />

introduced over the years and each has an underlying assumption of evolutionary<br />

dynamics. The first cluster method used in molecular sequence phylogeny was<br />

the UPGMA or group-averaging method (38). It takes the average value over all<br />

intergroup distances to measure the evolutionary distance between two groups of<br />

sequences and has the underlying assumption of identical mutation rates in all

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!