12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

300 I.A. Cymerman et al.larity cases contain less information in the template about the size and physicochemicalproperties of particular residues in the target. However, not all structure-derivedproperties provide additional information <strong>with</strong> respect <strong>to</strong> the template. For SDPs thatdepend mostly on position of residues, such as exposure state, neighbourhood of buriedresidues and number of surface pockets, models do not provide added-value. It isprobably caused by the fact that buried residues are more conserved than exposedresidues, comprising protein cores that are responsible for protein integrity.For other SDPs, such as neighbourhood of exposed residues and <strong>to</strong>tal accessiblesurface area (ASA), models show some added-value. This is very important as residuesaccessible <strong>to</strong> the solvent are responsible for interactions <strong>with</strong> other molecules,thus determining the biological function of the protein.Finally, for properties that strongly depend on the physicochemical characteristicsof the amino acids in the sequence, such as composition of pockets andelectrostatic potential, models show large added-value. The identification ofcharged regions is of large value as they may represent binding or active sites (seeChapter 7).To summarize: generally the studies performed by Chakravarty et al. demonstratethat, <strong>with</strong> the exception of the detection of pockets, most model-derivedstructural properties exhibit some level of added-value. The more a given propertydepends on the sequence of the protein the more useful a model will be in estimatingthe value of that property. Encouragingly, depending on the feature, 25–40%sequence identity between target and template was sufficient <strong>to</strong> produce a SDPestimate of comparable accuracy <strong>to</strong> that available from an NMR structure.12.3.1 ImplementationThe knowledge about the added-value of particular structural and surface propertiesof models raises the question whether they can be also useful for the functionprediction. In 1998 Fetrow and Skolnick proposed a multi-step procedure thatenables identification of protein functional sites in low-<strong>to</strong>-moderate resolutionmodels (Fetrow and Skolnick 1998). Based on geometry, residue identity, distancesbetween alpha carbons and conformation, the active site residues become athree dimensional descrip<strong>to</strong>r termed Fuzzy <strong>Function</strong>al Form (FFF). AfterwardsFFFs are used <strong>to</strong> screen the set of three dimensional models <strong>to</strong> identify <strong>with</strong>inthem those containing similar structural motifs. The usefulness of the method wasproved by the identification of the novel members of the disulphide glutaredoxin/thioredoxin protein family <strong>with</strong>in in the yeast (Fetrow and Skolnick 1998) and E.coli genomes (Fetrow et al. 1998), whose function could not be identified bysequence comparison methods. The great advantage of FFF and related approachesis that the method distinguishes protein pairs <strong>with</strong> similar active sites fromproteins pairs that may have similar folds, but not necessarily similar active sites.The FFF technology was further developed <strong>to</strong> the method called active siteprofiling (Cammer et al. 2003) and was successfully combined <strong>with</strong> experimental

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!