12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

8 3D Motifs 193structures <strong>to</strong>gether. A vertex in the graph represents a pair of a<strong>to</strong>ms or pseudoa<strong>to</strong>ms,one from structure A and one from structure B (where “structure” could be a 3Dmotif). Only a<strong>to</strong>ms <strong>with</strong> matching types are allowed <strong>to</strong> pair. Two vertices are connectedby an edge if the distance between the two a<strong>to</strong>ms in A matches the distancebetween the two a<strong>to</strong>ms in B, <strong>with</strong>in the specified <strong>to</strong>lerance. A clique is a graph inwhich every vertex is connected <strong>to</strong> every other vertex. Thus, clique detection identifiesa set of a<strong>to</strong>ms from A <strong>with</strong> internal distances completely consistent <strong>with</strong>those among a paired set of a<strong>to</strong>ms from B.● Depth-first searching. <strong>Structure</strong>s are searched exhaustively for the presence of a3D motif. The search space is limited by restrictions on the a<strong>to</strong>m or residue typesallowed <strong>to</strong> match, and by geometric cu<strong>to</strong>ffs such as distance <strong>to</strong>lerances and anupper bound on the overall root-mean-square deviation (RMSD).● Match extension. Partial or seed matches <strong>to</strong> a 3D motif are initially identified,and then attempts are made <strong>to</strong> extend the match <strong>to</strong> the entire motif.These methods will all seek motifs in a given static structure. Recent data showthat combining a motif searching algorithm <strong>with</strong> Molecular Dynamics simulationsthat sample conformational space (see also Chapter 9) is a promising, albeit computationallyexpensive, avenue <strong>to</strong>wards improving the results of the search (Glazeret al. 2008).8.2.3 Interpretation of ResultsMany web servers are available for comparing query structures <strong>to</strong> databases of 3Dmotifs (Table 8.1). What can be inferred about the function of a query when itmatches a motif? Several issues must be considered when deciding whether amatch is meaningful, and if so, what it means.What function is implied depends on how the motif was generated. If the motifrepresents a set of residues experimentally determined <strong>to</strong> be important for a particularfunction, a match suggests the query structure may perform that function.Similarly, matches <strong>to</strong> a motif based on multiple structures that perform some function(positive examples) are suggestive of that function, especially if the motif wasdesigned <strong>to</strong> exclude negative examples. What function is implied, however, dependson the criteria for membership in the positive set. For example, if the structures inthe positive set all bind adenosine but catalyze a variety of reactions, a match wouldimply adenosine binding but not necessarily adenosine deaminase activity. If thepositive set consists of representatives from a SCOP superfamily, a match wouldsuggest that the query protein also belongs <strong>to</strong> that superfamily, but would not implyanything more specific about its function.Ideally, for motif discovery the set of positive examples should be as diverseas possible while retaining the common feature, and the negative examplesshould be as similar as possible <strong>to</strong> the positive examples while lacking that feature.In practice, the positive and negative sets may not be ideal, and part or all of

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!