13.07.2015 Views

computer modeling in molecular biology.pdf

computer modeling in molecular biology.pdf

computer modeling in molecular biology.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

30 Tim J.l? Hubbard and Arthur M. Leskwidespread. Such methods make use of multiple sequence <strong>in</strong>formation, look<strong>in</strong>g forconsistency between the predictions for different sequences. It should be noted thateven given a correct secondary structure assignment, it is very difficult to determ<strong>in</strong>ehow the units fit together <strong>in</strong> three dimensions. [77]. A table of aligned sequences maywell conta<strong>in</strong> derivable <strong>in</strong>formation about the 3-D structure of a prote<strong>in</strong> but attemptsto recover it have so far met with no more than sporadic success [12]. However, usefuldeductions about the most likely folded structure can be made <strong>in</strong> a systematic wayfrom a comb<strong>in</strong>ation of analysis of SSP results and the conservation patternsobserved <strong>in</strong> a multiple sequence alignment. Successful predictions us<strong>in</strong>g such an approachhave been made for the annex<strong>in</strong> [78] and Src homology 2 (SH2) prote<strong>in</strong>families [79].2.4.2 A Lone Sequence or a Designed Sequence:no Multiple Sequence, no Known RelativesThis situation is the most unfavourable for model build<strong>in</strong>g; as one has no way ofapply<strong>in</strong>g known sequence or structure <strong>in</strong>formation. In effect the problem can onlybe handled by a priori methods, Even secondary structure prediction is <strong>in</strong>accuratefor s<strong>in</strong>gle sequences and therefore the likelihood of build<strong>in</strong>g a correct three-dimensionalmodel is small.If there is any suspicion (perhaps on functional grounds) that a natural sequencehas a certa<strong>in</strong> fold, or <strong>in</strong> the case of a designed sequence, built to fold <strong>in</strong> a particularway, the situation is slightly better s<strong>in</strong>ce it is possible to test the likelihood that a sequencecan match a particular fold. Methods for do<strong>in</strong>g this <strong>in</strong>clude check<strong>in</strong>g polarity1801; pack<strong>in</strong>g quality and residue-residue contact frequencies [81] ; various freeenergy functions <strong>in</strong>corporat<strong>in</strong>g solvation effects [82, 831, hydration and heat stabilityeffects and more recently us<strong>in</strong>g thread<strong>in</strong>g techniques to establish if the sequence iscompatible with the fold [MI.The disadvantages of these methods are that (1) most provide only an assessmentof the structure as a whole rather than of local regions (models are frequently onlypartially right e. g. [86]) (2) even at this level they are <strong>in</strong>accurate, i. e. some experimentallydeterm<strong>in</strong>ed structures are classified as <strong>in</strong>correct whereas some misfolded modelsare classed as correct <strong>in</strong> bl<strong>in</strong>d tests and (3) that the results are essentially dependenton the quality of the model rather than the correctness of its fold. Moreover, thesetests only look at the f<strong>in</strong>al state and do not assess if the sequence is compatible withany pathway to that state. For natural sequences it can be assumed that fold<strong>in</strong>g toa compact state can be achieved but this is more likely to fail to be the case fordesigned sequences. Current experimental [87] and theoretical work [88] on thefold<strong>in</strong>g pathways of prote<strong>in</strong>s suggest that there are clear fold<strong>in</strong>g <strong>in</strong>itiation sites

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!