13.07.2015 Views

computer modeling in molecular biology.pdf

computer modeling in molecular biology.pdf

computer modeling in molecular biology.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

2 Modell<strong>in</strong>n Prote<strong>in</strong> Structures 11surmountable problems with this approach that have deprived it of the success itshould <strong>in</strong> pr<strong>in</strong>ciple someday achieve. The major problems are limitations on <strong>computer</strong>time (currently time <strong>in</strong>tervals of at most nanoseconds can be simulated withtoday’s fastest <strong>computer</strong>s, whereas prote<strong>in</strong> fold<strong>in</strong>g is thought to occur over seconds);and the problem of <strong>in</strong>complete and <strong>in</strong>exact representation of the thermodynamic <strong>in</strong>teractions- particularly the problem of appropriately represent<strong>in</strong>g the prote<strong>in</strong>water<strong>in</strong>teraction (prote<strong>in</strong>s fold <strong>in</strong> an aqueous environment which must berepresented explicitly and accurately) - and other uncerta<strong>in</strong>ties about the <strong>in</strong>teractionpotentials used between atoms <strong>in</strong> the system, particularly the electrostatic terms.MD can however be usefully applied to modell<strong>in</strong>g problems where there are sufficientconformational restra<strong>in</strong>ts,All other a priori prediction methods are less physically realistic than the aboveand <strong>in</strong>volve either unproven assumptions or unlikely generalisations (or both). Nomethod that might be expected to become generally successful is <strong>in</strong> sight.As a priori prediction is unsuccessful, we are fortunate that - unlike a prote<strong>in</strong>cha<strong>in</strong> fold<strong>in</strong>g <strong>in</strong> vivo - modellers can make use of knowledge from all known prote<strong>in</strong>sequences and structures when predict<strong>in</strong>g a prote<strong>in</strong> fold. It is the extraction andapplication of this <strong>in</strong>formation that is the basis for all successful prediction methods.Central to such methods is the idea of homology.2.1.2 The Idea of Homology Between Prote<strong>in</strong>sTwo prote<strong>in</strong>s are homologous if they are related by natural evolutionary processes.Many homologous sequences are sufficiently closely related that their am<strong>in</strong>o acid sequencescan be aligned so that the number of similar or identical pairs of am<strong>in</strong>oacids at aligned positions is greater than expected by chance. An alignment may conta<strong>in</strong>gaps because as a prote<strong>in</strong> sequence evolves both mutation and <strong>in</strong>sertion/deletionevents can occur <strong>in</strong> the encod<strong>in</strong>g DNA.Form<strong>in</strong>g an accurate alignment of the am<strong>in</strong>o acid sequences is absolutely essentialfor useful model build<strong>in</strong>g. When the divergence of the sequences has left no fewerthan about 40% of the residues identical <strong>in</strong> an optimal alignment, it is likely thatthe standard sequence-alignment methods will provide a correct alignment [4]. Indeed,such an alignment is prima facie evidence for homology, which of course canonly rarely be detected directly. (The exceptions are cases of obvious gene duplicationand divergence. A classic example of this is the two doma<strong>in</strong>s of rhodanese [5].) Whensequences have diverged substantially farther than this 40% threshold, it may be impossibleto determ<strong>in</strong>e the correct alignment from pairs of sequences only, but it maybe possible to determ<strong>in</strong>e a correct alignment from a comparison of the structures,provided of course that they are available. Also, multiple sequence alignments aremuch more <strong>in</strong>formative than alignments of only a s<strong>in</strong>gle pair of sequences. In ex-

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!