From Protein Structure to Function with Bioinformatics.pdf

More documents

Recommendations

Info

72 A. Fiserfold, such as the hypervariable regions in the immunoglobulin fold (Chothia et al.1989). Earlier it was predicted that it is unlikely that structure databanks will everreach a point when fragment based approaches become efficient to model loops(Fidelis et al. 1994), which resulted in a boost in the development of conformationalsearch approaches from around 2000. However, many details of the fold universehas been explored during the last decade due to the large number of new foldssolved experimentally, which had a profound effect on the extent of known structuralfragments. Recent analyses showed that loop fragments are not only well representedin current structure databanks but shorter segments are possibly completelyexplored already (Du et al. 2003). It was reported that sequence segments up to tenresidues had a related (i.e. at least 50% identical segment) in PDB with a knownconformation, and despite the six fold increase in sequence databank size and thedoubling of PDB since 2002 there was not a single unique loop conformationentered in the PDB or sequence segment observed that shares less than 50%sequence identity to a PDB fragment, which indicates that newly sequenced proteinskeep recycling the same set of already known short structural segments. Allsequence segments up to 10–12 residues have at least one corresponding structuralsegment that shares at least 50% identity thus ensuring structural similarity, excepta very few notable exceptions mentioned above (Fernandez-Fuentes and Fiser2006). Consequently more recent efforts have tried to classify loop conformationsinto more general categories, thus extending the applicability of the database searchapproach for more cases (Fernandez-Fuentes et al. 2006a; Michalsky et al. 2003).A recent work described the advantage of using HMM sequence profiles in classifyingand predicting loops (Espadaler et al. 2004). An another recently publishedloop prediction approach first predicts conformation for a query loop sequence andthen structurally aligns the predicted structural fragments to a set of non-redundantloop structural templates. These sequence-template loop alignments are then quantitativelyevaluated with an artificial neural network model trained on a set of predictionswith known outcomes (Peng and Yang 2007).ArchPred, perhaps the most accurate database loop modelling approach, isbriefly described here (Fernandez-Fuentes et al. 2006a, b). ArchPred exploits ahierarchical and multidimensional database that has been set up to classify about300,000 loop fragments and loop flanking secondary structures. Besides the lengthof the loops and types of bracing secondary structures the database is organizedalong four internal coordinates, a distance and three types of angles characterizingthe geometry of stem regions (Oliva et al. 1997). Candidate fragments are selectedfrom this library by matching the length, the types of bracing secondary structuresof the query and satisfying the geometrical restraints of the stems and subsequentlyinserted in the query protein framework where their fit is assessed by the root meansquared deviation (RMSD) of stem regions and by the number of rigid body clasheswith the environment. In the final step, remaining candidate loops are ranked by aZ-score that combines information on sequence similarity and fit of predicted andobserved φ/ψ main chain dihedral angle propensities. Confidence Z-score cut-offswere determined for each loop length that identify those predicted fragments thatoutperform a competitive ab initio method. A web server implements the method,
3 Comparative Protein Structure Modelling 73regularly updates the fragment library and performs predictions. Predicted segmentsare returned or, optionally, these can be completed with side chain reconstructionand subsequently annealed in the environment of the query protein byconjugate gradient minimization.In summary, the recent reports about the more favourable coverage of loop conformationsin the PDB suggest that database approaches are now limited by theirability to recognize suitable fragments, and not by the lack of these segments(i.e. sampling), as earlier thought.Ab Initio Modelling of LoopsTo overcome the limitations of the database search methods, conformationalsearch methods were developed. There are many such methods, exploiting differentprotein representations, objective function terms, and optimization or enumerationalgorithms. The search strategies include the minimum perturbationmethod (Fine et al. 1986), molecular dynamics simulations (Bruccoleri andKarplus 1987), genetic algorithms (Ring and Cohen 1993), Monte Carlo andsimulated annealing (Abagyan and Totrov 1994; Collura et al. 1993), multiplecopysimultaneous search (Zheng et al. 1993), self-consistent field optimization(Koehl and Delarue 1995), and an enumeration based on the graph theory(Samudrala and Moult 1998). Loop prediction by optimization is applicable toboth simultaneous modelling of several loops and to those loops interacting withligands, neither of which is straightforward for the database search approaches,where fragments are collected from unrelated structures with differentenvironments.The MODLOOP module in MODELLER implements the optimization-basedapproach (Fiser et al. 2000; Fiser and Sali 2003b). Loop optimization inMODLOOP relies on conjugate gradients and molecular dynamics with simulatedannealing. The pseudo energy function is a sum of many terms, includingsome terms from the CHARMM-22 molecular mechanics force field (Brookset al. 1983) and spatial restraints based on distributions of distances (Melo andFeytmans 1997; Sippl 1990) and dihedral angles in known protein structures. Tosimulate comparative modelling problems, the loop modelling procedure wasoptimized and evaluated on a large number of loops of known structure both innative and in only approximately correct environments. The performance of theapproach later was further improved by using CHARMM molecular mechanicforcefield with Generalized Born (GB) solvation potential to rank final conformations(Fiser et al. 2002). Incorporation of solvation terms in the scoring functionwas a central theme in several other subsequent studies (Das and Meirovitch2003; de Bakker et al. 2003; DePristo et al. 2003; Forrest and Woolf 2003).Improved loop prediction accuracy resulted from the incorporation of an entropylike term to the scoring function, the “colony energy”, derived from geometricalcomparisons and clustering of sampled loop conformations (Fogolari andTosatto 2005; Xiang et al. 2002). The continuous improvement of scoring
Page 3:
Daniel John RigdenEditorFrom Protei
Page 8 and 9:
ContentsSection I Generating and In
Page 10 and 11:
Contentsxi4.2.1 Alpha-Helical Bundl
Page 12 and 13:
Contentsxiii7.4.2 Predicting Bindin
Page 14 and 15:
Contentsxv12.3 Accuracy and Added V
Page 16 and 17:
4 J. Lee et al.about 5.3 million pr
Page 18 and 19:
6 J. Lee et al.Table 1.1 A list of
Page 20 and 21:
8 J. Lee et al.2000; Sorin and Pand
Page 22 and 23:
10 J. Lee et al.Although most knowl
Page 24 and 25:
12 J. Lee et al.Fig. 1.3 Two exampl
Page 26 and 27:
14 J. Lee et al.rugged containing m
Page 28 and 29:
16 J. Lee et al.1.3.4 Mathematical
Page 30 and 31:
18 J. Lee et al.potentials, each re
Page 32 and 33: 20 J. Lee et al.viewpoint of struct
Page 34 and 35: 22 J. Lee et al.Hsieh MJ, Luo R (20
Page 36 and 37: 24 J. Lee et al.Sippl MJ (1990) Cal
Page 38 and 39: Chapter 2Fold RecognitionLawrence A
Page 40 and 41: 2 Fold Recognition 29It has long be
Page 42 and 43: 2 Fold Recognition 31Fig. 2.2 Graph
Page 44 and 45: 2 Fold Recognition 33distribute the
Page 46 and 47: 2 Fold Recognition 35Table 2.1 (a)
Page 48 and 49: 2 Fold Recognition 37dependent on t
Page 50 and 51: 2 Fold Recognition 39based on the r
Page 52 and 53: 2 Fold Recognition 41The idea of co
Page 54 and 55: 2 Fold Recognition 43query sequence
Page 56 and 57: 2 Fold Recognition 452.3.4 Consensu
Page 58 and 59: 2 Fold Recognition 472.4 Alignment
Page 60 and 61: 2 Fold Recognition 49statistical me
Page 62 and 63: 2 Fold Recognition 51methods (e.g.
Page 64 and 65: 2 Fold Recognition 53Berman HM, Wes
Page 66 and 67: 2 Fold Recognition 55Tress ML, Jone
Page 68 and 69: 58 A. Fiserwith more than 50% seque
Page 70 and 71: 60 A. FiserIn contrast to ab initio
Page 72 and 73: 62 A. FiserTable 3.1 (continued)SWI
Page 74 and 75: 64 A. Fiserbetween fold recognition
Page 76 and 77: 66 A. FiserFig. 3.1 Comparing accur
Page 78 and 79: 68 A. Fiserinto conserved core regi
Page 80 and 81: 70 A. Fiseralignments are built and
Page 84 and 85: 74 A. Fiserfunctions delivers impro
Page 86 and 87: 76 A. Fiserfrom efforts of genome s
Page 88 and 89: 78 A. FiserA rigorous statistical e
Page 90 and 91: 80 A. Fiser(Clore et al. 1993). Cha
Page 92 and 93: 82 A. FiserImproved and new methods
Page 94 and 95: 84 A. FiserClaessens M, Van Cutsem
Page 96 and 97: 86 A. FiserHavel TF, Snow ME (1991)
Page 98 and 99: 88 A. FiserPetrey D, Xiang Z, Tang
Page 100 and 101: 90 A. Fiservan Vlijmen HW, Karplus
Page 102 and 103: 92 T. Nugent and D.T. Jones4.2 Stru
Page 104 and 105: 94 T. Nugent and D.T. JonesFig. 4.2
Page 106 and 107: 96 T. Nugent and D.T. JonesTable 4.
Page 108 and 109: 98 T. Nugent and D.T. Jones2000), d
Page 110 and 111: 100 T. Nugent and D.T. JonesTable 4
Page 112 and 113: 102 T. Nugent and D.T. JonesFig. 4.
Page 114 and 115: 104 T. Nugent and D.T. JonesFig. 4.
Page 116 and 117: 106 T. Nugent and D.T. Jonessubdivi
Page 118 and 119: 108 T. Nugent and D.T. Jonescomplex
Page 120 and 121: 110 T. Nugent and D.T. JonesMartell
Page 122 and 123: Chapter 5Bioinformatics Approaches
Page 124 and 125: 5 Structure and Function of Intrins
Page 132 and 133:
5 Structure and Function of Intrins
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Chapter 6Function Diversity Within
Page 152 and 153:
6 Function Diversity Within Folds a
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
Chapter 7Predicting Protein Functio
Page 176 and 177:
7 Predicting Protein Function from
Page 178 and 179:
Page 180 and 181:
Page 182 and 183:
Page 184 and 185:
Page 186 and 187:
Page 188 and 189:
Page 190 and 191:
Table 7.1 Online resources and tool
Page 192 and 193:
Page 194 and 195:
Chapter 83D MotifsElaine C. Meng, B
Page 196 and 197:
8 3D Motifs 189clustering similar s
Page 198 and 199:
8 3D Motifs 191To improve the signa
Page 200 and 201:
8 3D Motifs 193structures together.
Page 202 and 203:
8 3D Motifs 195Table 8.1 (continued
Page 204 and 205:
8 3D Motifs 1978.3.1 User-Defined M
Page 206 and 207:
8 3D Motifs 199Fig. 8.3 The FFF mot
Page 208 and 209:
8 3D Motifs 2018.3.2 Motif Discover
Page 210 and 211:
8 3D Motifs 203In addition to SITE
Page 212 and 213:
8 3D Motifs 205folds; they shared a
Page 214 and 215:
8 3D Motifs 207from a training set
Page 216 and 217:
8 3D Motifs 209Table 8.3 Web server
Page 218 and 219:
8 3D Motifs 211its greater overall
Page 220 and 221:
8 3D Motifs 213Ideally, a 3D motif
Page 222 and 223:
8 3D Motifs 215Ivanisenko VA, Pintu
Page 224 and 225:
Chapter 9Protein Dynamics: From Str
Page 226 and 227:
9 Protein Dynamics: From Structure
Page 228 and 229:
Page 230 and 231:
Page 232 and 233:
Page 234 and 235:
Page 236 and 237:
Page 238 and 239:
Page 240 and 241:
Page 242 and 243:
Page 244 and 245:
Page 246 and 247:
Page 248 and 249:
Page 250 and 251:
Page 252 and 253:
Page 254 and 255:
Page 256 and 257:
Page 258 and 259:
252 R.A. LaskowskiConsequently, man
Page 260 and 261:
254 R.A. Laskowskiucla.edu, and Pro
Page 262 and 263:
256 R.A. LaskowskiFig. 10.2 Gene On
Page 264 and 265:
258 R.A. Laskowski10.2.5 Protein In
Page 266 and 267:
260 R.A. LaskowskiFig. 10.4 Schemat
Page 268 and 269:
262 R.A. Laskowskiaccessibility and
Page 270 and 271:
264 R.A. LaskowskiFig. 10.6 A ligan
Page 272 and 273:
266 R.A. LaskowskiGly97Gly99Tyr98Ty
Page 274 and 275:
268 R.A. LaskowskiFig. 10.8 Example
Page 276 and 277:
270 R.A. LaskowskiReferencesAltschu
Page 278 and 279:
272 R.A. LaskowskiXenarios I, Salwi
Page 280 and 281:
274 J.D. Watson and J.M. ThorntonTh
Page 282 and 283:
276 J.D. Watson and J.M. ThorntonTa
Page 284 and 285:
278 J.D. Watson and J.M. Thorntonva
Page 286 and 287:
280 J.D. Watson and J.M. Thorntonfo
Page 288 and 289:
282 J.D. Watson and J.M. Thorntonin
Page 290 and 291:
284 J.D. Watson and J.M. Thorntonan
Page 292 and 293:
286 J.D. Watson and J.M. ThorntonFi
Page 294 and 295:
288 J.D. Watson and J.M. ThorntonPD
Page 296 and 297:
290 J.D. Watson and J.M. ThorntonKi
Page 298 and 299:
Chapter 12Prediction of Protein Fun
Page 300 and 301:
12 Prediction of Protein Function f
Page 302 and 303:
Page 304 and 305:
Page 306 and 307:
Page 308 and 309:
Page 310 and 311:
Page 312 and 313:
Page 314 and 315:
Page 316 and 317:
Page 318 and 319:
Page 320 and 321:
Page 322 and 323:
Page 324 and 325:
320 IndexConsensus templates, 205Co
Page 326 and 327:
322 IndexLigand-binding templates,
Page 328:
324 IndexStructure-function linkage
show all

From Protein Structure to Function with Bioinformatics.pdf

Create successful ePaper yourself

Delete template?

Save as template?