12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

42 L.A. Kelleyperformance at CASP4 of several groups who outperformed previously successfulthreading groups using PSI-BLAST or some variant.The key <strong>to</strong> the success of the PSI-BLAST approach lies in a realisation thatevery position in a protein sequence will be under different evolutionary pressures.For example, a glycine in one position may be highly conserved as it isrequired for a particularly tight turn of the protein chain <strong>to</strong> maintain its <strong>to</strong>pology.Any mutation in this glycine may be lethal as the protein would fail <strong>to</strong> foldcorrectly. A different glycine elsewhere in the sequence may be in a highly variableloop region under minimal selection pressure. Thus when aligning a querysequence against this structure, the first glycine must be present, but the secondone may vary. It is this position-specific mutational propensity that permits farmore sensitive remote homology detection.A typical use for PSI-BLAST-generated profiles is where the profile for a querysequence is scanned against a database of sequences from the PDB, or conversely,a query sequence is scanned against a library of template profiles. The profilesthemselves need not come from PSI-BLAST. Profile Hidden Markov Models(HMMs) are built from multiple sequence alignments but include more informationthan a standard profile, including the positions of common insertions anddeletions, and transition probabilities <strong>to</strong> and from match states <strong>to</strong> each position.Once again this was often coupled <strong>to</strong> the use of predicted structural features suchas secondary structure. The alternative sequence-profile and profile-sequenceapproaches are shown schematically in Figs. 2.4c and d.Improved profiles and HMMs can be built by using structure alignments ofremote homologues and by adding sequences of unknown structure that can be easilyaligned <strong>with</strong> each structure (Kelley et al. 2000; Tang et al. 2003). However, usingstructure alignments <strong>to</strong> build better profiles has often resulted in modest improvementsin remote homologue detection or alignment accuracy. This is probably due<strong>to</strong> the non-uniqueness of sequence alignments generated from structure alignments,especially in the vicinity of large insertions or deletions or significant structuralchanges. These may result in misalignments between sets of sequences related <strong>to</strong>each structure. The solution of Zhou and Zhou (2005) was <strong>to</strong> generate fragments ofproteins and use these <strong>to</strong> build profiles in their successful SP3 method.Hidden Markov Models have been used extensively by a variety of groups <strong>to</strong> goodeffect in recent years. As mentioned above, one of their key advantages over the relativelysimpler profiles generated by PSI-BLAST is the presence of extra informationregarding gaps and neighbouring residues. However, for both profiles and HMMs,the multiple sequence alignment from which they are derived is of key importance.The sequences used and the quality of the alignment are probably more important <strong>to</strong>the power of the profile than subtleties regarding the statistical handling of the alignmentin generating the profile. As a result many groups have found it useful <strong>to</strong> gatherhomologous sequences using PSI-BLAST, but use a more powerful multiple sequencealignment program <strong>to</strong> generate a more accurate multiple alignment.As a generalisation of sequence-profile alignments or sequence-HMM comparisons,profile-profile and HMM-HMM alignments have recently demonstrated significantlysuperior performance. Thus instead of using profiles (or HMMs) for only the

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!