12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

58 A. Fiser<strong>with</strong> more than 50% sequence identity, transfer of annotation between homologues leads<strong>to</strong> an erroneous attribution <strong>with</strong> a <strong>to</strong>tally dissimilar function in 6% of cases.<strong>Function</strong>al characterization of a protein is often facilitated by its three-dimensional(3D) structure. The insight that one may gain from a 3D model ranges fromsuch low level functional descriptions as confirming the fold (G. Wu et al. 2000)and inferring a general functional role, <strong>to</strong> such high resolution descriptions asunderstanding ligand specificities and designing inhibi<strong>to</strong>rs in the context of structurebased drug discovery (Becker et al. 2006; Evers et al. 2003).3.1.2 Sequences, <strong>Structure</strong>s, Structural GenomicsGenome scale sequencing projects have already produced around six millionunique sequences <strong>to</strong> date (Apweiler et al. 2004; C. H. Wu et al. 2006), a numberthat would double if metagenomic data from Craig Venter’s Global Ocean Surveywas added <strong>to</strong> public databases (Rusch et al. 2007; Venter et al. 2004; Yooseph et al.2007). Meanwhile only ∼50,000 of these proteins have their three-dimensionalstructures solved experimentally using X-ray crystallography or Nuclear MagneticResonance (NMR) spectroscopy (Berman et al. 2007). Because of the inherentlytime consuming and complicated nature of structure determination techniques, thefraction of experimentally known 3D models is expected <strong>to</strong> further shrink from thecurrent level of less than 1%. Computational approaches need <strong>to</strong> be employed <strong>to</strong>bridge the gap between the number of known sequences and that of 3D models.Structural genomics projects were launched worldwide around the year 2000.One key aim is the experimental solution of the three dimensional structures of acarefully selected few thousand target sequences of structurally uncharacterizedproteins. These newly solved structures could be used as templates for computationalmodelling of about 100 times more proteins whose sequences are related(Burley et al. 1999). These worldwide structural genomics efforts are becoming thedominant source of experimentally solved protein structures. In recent years 75%of new folds deposited <strong>to</strong> the PDB emerged from these efforts (Burley et al. 2008).Meanwhile these efforts further underline the importance of theoretical approaches<strong>to</strong> structure modelling, since more than 99% of all three dimensional models willbe obtained computationally (Manjasetty et al. 2007).3.1.3 Approaches <strong>to</strong> <strong>Protein</strong> <strong>Structure</strong> PredictionThe study of principles that dictate the three-dimensional structure of natural proteinscan be approached either through the laws of physics or the theory of evolution.Each of these approaches provides foundation for a class of protein structureprediction methods (Fiser et al. 2002).The first approach, ab initio or template-free modelling methods, discussed inChapter 1, predicts the structure from sequence alone (Bonneau and Baker 2001;

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!