12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

52 L.A. KelleyIt is unclear <strong>to</strong> what extent improvements in structure prediction have been theresult of increases in database size versus algorithm improvement. Although thesequence database has been growing in size exponentially, its information contenthas not. The vast majority of sequences added <strong>to</strong> the sequence database each yearare highly similar <strong>to</strong> those already contained <strong>with</strong>in the database. Recent work(Chubb, Kelley and Sternberg, manuscript in preparation) illustrates that despite agrowing sequence database, homology detection <strong>with</strong> standard <strong>to</strong>ols (such as Psi-BLAST) is plateauing. It is thus hard <strong>to</strong> imagine further significant increases inhomology detection based on mining the sequence database alone. It is also unclearhow much the recent improvements in ab initio structure prediction may be due <strong>to</strong>the increased structural database containing ever better fragments of structure foruse in fragment assembly methods (Zhang and Skolnick 2005).The database of sequences and structure will continue <strong>to</strong> increase. Even if algorithmicdevelopment on structure prediction were <strong>to</strong> cease <strong>to</strong>day, structure prediction accuracywould continue <strong>to</strong> improve. Ignoring protein design issues, structure prediction isan exercise in practical reduction of time and cost in determining protein structure.The desire <strong>to</strong> ‘solve’ the protein folding problem is alive and well as one of theholy grails of molecular biology. Yet even in the absence of such a ‘solution’ itseems likely that we will have usefully accurate models of most, if not all proteinsfound in nature <strong>with</strong>in a reasonable time. Whether that time is 5, 10 or 50 years willbe down <strong>to</strong> the ingenuity and diligent work of both experimentalists solving structuresand genomes, and modellers mining the information therein for all it is worth.This is no longer a question of ‘if’ but ‘when’.<strong>Protein</strong> design however remains a very different and daunting task, founded ona deep understanding of protein folding. To understand protein folding is <strong>to</strong> understandhow the ‘software’ of DNA becomes the ‘hardware’ of functional proteins.It is <strong>to</strong> understand, at a fundamental level, the nature of living things. However,there may be no elegant solution <strong>to</strong> the protein folding problem. Nature does notnecessarily find an elegant solution; simply one that works. Reluctantly, we mayhave <strong>to</strong> be satisfied <strong>with</strong> a complex predictive framework. Nevertheless the hope fora simple, computationally tractable and hither<strong>to</strong> undiscovered explanation for proteinfolding remains strong.Acknowledgements I would like <strong>to</strong> thank Dr. Benjamin Jefferys for his extensive help <strong>with</strong> theillustrations in this chapter.ReferencesAltschul SF, Madden TL, Schäffer AA, et al. (1997) Gapped BLAST and PSI-BLAST: a newgeneration of protein database search programs. Nucleic Acids Res. 25:3389–3402Bateman A and Finn RD (2007) SCOOP: a simple method for identification of novel proteinsuperfamily relationships. <strong>Bioinformatics</strong> 23:809–814Bennett-Lovsey RM, Herbert AD, Sternberg MJ, et al. (2008) Exploring the extremes of sequence/structure space <strong>with</strong> ensemble fold recognition in the program Phyre. <strong>Protein</strong>s. 70:611–625

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!