12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

72 A. Fiserfold, such as the hypervariable regions in the immunoglobulin fold (Chothia et al.1989). Earlier it was predicted that it is unlikely that structure databanks will everreach a point when fragment based approaches become efficient <strong>to</strong> model loops(Fidelis et al. 1994), which resulted in a boost in the development of conformationalsearch approaches from around 2000. However, many details of the fold universehas been explored during the last decade due <strong>to</strong> the large number of new foldssolved experimentally, which had a profound effect on the extent of known structuralfragments. Recent analyses showed that loop fragments are not only well representedin current structure databanks but shorter segments are possibly completelyexplored already (Du et al. 2003). It was reported that sequence segments up <strong>to</strong> tenresidues had a related (i.e. at least 50% identical segment) in PDB <strong>with</strong> a knownconformation, and despite the six fold increase in sequence databank size and thedoubling of PDB since 2002 there was not a single unique loop conformationentered in the PDB or sequence segment observed that shares less than 50%sequence identity <strong>to</strong> a PDB fragment, which indicates that newly sequenced proteinskeep recycling the same set of already known short structural segments. Allsequence segments up <strong>to</strong> 10–12 residues have at least one corresponding structuralsegment that shares at least 50% identity thus ensuring structural similarity, excepta very few notable exceptions mentioned above (Fernandez-Fuentes and Fiser2006). Consequently more recent efforts have tried <strong>to</strong> classify loop conformationsin<strong>to</strong> more general categories, thus extending the applicability of the database searchapproach for more cases (Fernandez-Fuentes et al. 2006a; Michalsky et al. 2003).A recent work described the advantage of using HMM sequence profiles in classifyingand predicting loops (Espadaler et al. 2004). An another recently publishedloop prediction approach first predicts conformation for a query loop sequence andthen structurally aligns the predicted structural fragments <strong>to</strong> a set of non-redundantloop structural templates. These sequence-template loop alignments are then quantitativelyevaluated <strong>with</strong> an artificial neural network model trained on a set of predictions<strong>with</strong> known outcomes (Peng and Yang 2007).ArchPred, perhaps the most accurate database loop modelling approach, isbriefly described here (Fernandez-Fuentes et al. 2006a, b). ArchPred exploits ahierarchical and multidimensional database that has been set up <strong>to</strong> classify about300,000 loop fragments and loop flanking secondary structures. Besides the lengthof the loops and types of bracing secondary structures the database is organizedalong four internal coordinates, a distance and three types of angles characterizingthe geometry of stem regions (Oliva et al. 1997). Candidate fragments are selectedfrom this library by matching the length, the types of bracing secondary structuresof the query and satisfying the geometrical restraints of the stems and subsequentlyinserted in the query protein framework where their fit is assessed by the root meansquared deviation (RMSD) of stem regions and by the number of rigid body clashes<strong>with</strong> the environment. In the final step, remaining candidate loops are ranked by aZ-score that combines information on sequence similarity and fit of predicted andobserved φ/ψ main chain dihedral angle propensities. Confidence Z-score cut-offswere determined for each loop length that identify those predicted fragments tha<strong>to</strong>utperform a competitive ab initio method. A web server implements the method,

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!