12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

10 Integrated Servers for <strong>Structure</strong>-Informed <strong>Function</strong> Prediction 253Plus there are various spanners that gum up the works. Firstly, it is often difficult<strong>to</strong> solve the whole intact protein. In these cases the best one can get is a structuralmodel for part of the protein – say, just a single domain. On its own this domainmay say little about the protein’s function. Secondly, even if the whole protein issolved, it may be just one component of a multi-protein complex. Again, the structuregives only part of the s<strong>to</strong>ry. More dastardly still are the so-called moonlightingproteins which can actually have more than one function, depending on their context:cellular location, environment, and so on (Jeffery 1999). And some proteinscan alter their function according <strong>to</strong> which alternatively spliced variant is expressedat any given time (Stamm et al. 2005).Another problem <strong>with</strong> function prediction is the difficulty of assessing the successor failure of a given prediction method and, indeed, even defining what ismeant by function. <strong>Function</strong> can be described at many levels, ranging from biochemicalfunction through biological processes and pathways, all the way up <strong>to</strong>the organ or organism level (Shrager 2003). Consequently, a given protein may beannotated at several different levels of functional specificity: for example, ubiquitinlikedomain, signalling protein, predicted serine hydrolase, probable eukaryoticD-amino acid tRNA deacetylase, and so on. Thus it is difficult <strong>to</strong> judge theaccuracy of any such assignment, especially if the assignment is one of the morevague ones.A common strategy for assessing function prediction methods is <strong>to</strong> use the GeneOn<strong>to</strong>logy (GO) (The Gene On<strong>to</strong>logy Consortium 2000; Camon et al. 2004). This isan open source scheme for functional annotation of protein sequences. It is amachine-readable on<strong>to</strong>logy based on a controlled vocabulary of functional descrip<strong>to</strong>rsand many function prediction methods couch their results in terms of GOcodes. Although not strictly hierarchical, the GO functional descrip<strong>to</strong>rs range fromthe truly unspecific (e.g. enzyme) down <strong>to</strong> the highly precise (e.g. 1-pyrroline-4-hydroxy-2-carboxylate deaminase).10.1.2 <strong>Structure</strong>-<strong>Function</strong> Prediction MethodsAs the previous chapters show, there are very many different methods for predictingfunction from structure. Several reviews have described them and considered theirusefulness (Kim et al. 2003; Watson et al. 2005; Rigden 2006). None of the methodsis perfect and none can hope <strong>to</strong> be successful in all situations. For example,some methods are only suitable for enzymes – and so cannot help at all if the proteinin question is not an enzyme. Other methods rely strongly on some match –whether of the fold, or a motif, or a binding site, etc. – <strong>to</strong> a protein of knownstructure. So if no match can be found, or the match is merely <strong>to</strong> another hypotheticalprotein, such a method effectively returns a blank.Consequently, a sensible approach is <strong>to</strong> throw a large number of these predictivemethods at the protein structure and see what drops out. The two servers describedin this chapter do just that. They are ProKnow from UCLA at http://proknow.mbi.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!