29.10.2014 Views

advances-in-protein-chemistry

advances-in-protein-chemistry

advances-in-protein-chemistry

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

evaluation [78], Whatcheck [79] and Procheck [80], which produces reports so that the user can estimate the quality of<br />

prote<strong>in</strong> models and template structures.<br />

The SWISS-MODEL Repository is also available for the annotation of three-dimensional comparative prote<strong>in</strong> structure<br />

models generated by homology-model<strong>in</strong>g pipel<strong>in</strong>e SWISS-MODEL [51].<br />

SWISS-MODEL is developed by the Computational Structural Biology Group at the Swiss Institute of Bio<strong>in</strong>formatics<br />

(SIB) and the Biozentrum of the University of Basel. SWISS-MODEL was <strong>in</strong>itiated <strong>in</strong> 1993 by Manuel Peitsch, and further<br />

developed at GWER - Glaxo Wellcome Experimental Research <strong>in</strong> Geneva and the SIB - Swiss Institute of Bio<strong>in</strong>formatics by<br />

Manuel Peitsch, Nicolas Guex and Torsten Schwede. S<strong>in</strong>ce 2001, SWISS-MODEL is be<strong>in</strong>g developed by Torsten Schwede’s<br />

Structural Bio<strong>in</strong>formatics Group at the Biozentrum (University of Basel) and SIB - Swiss Institute of Bio<strong>in</strong>formatics.<br />

3.21.3 3D-JIGSAW<br />

3D-JIGSAW is a web-based server which generates 3D models for prote<strong>in</strong>s based on homologues of known structure.<br />

The program can either be run <strong>in</strong> completely automatic mode by means of a web server or the <strong>in</strong>dividual modules of<br />

the program may be executed separately and the <strong>in</strong>termediate files saved which can be modified, if necessary, before the<br />

next module <strong>in</strong> the series is run [75]. In automatic mode, the program looks for homologous templates <strong>in</strong> its sequence<br />

databases and splits the query sequence <strong>in</strong>to doma<strong>in</strong>s and the best covered doma<strong>in</strong> is modeled us<strong>in</strong>g a maximum of two<br />

good templates, if found. The process may take up to an hour on a system. An e-mail with the alignment between query<br />

and template and a PDB formatted set of coord<strong>in</strong>ates is sent to user, while <strong>in</strong> an <strong>in</strong>teractive mode an e-mail is sent back<br />

to user with a l<strong>in</strong>k to a graphical display of the doma<strong>in</strong> arrangement and useful <strong>in</strong>formation extracted from the PFAM<br />

database. From this l<strong>in</strong>k, user can select the doma<strong>in</strong>s needed to be modeled and can select the templates and correct the<br />

alignments before submitt<strong>in</strong>g a model<strong>in</strong>g job. Templates are ranked accord<strong>in</strong>g to the coverage of the query, their sequence<br />

identity and their crystallographic resolution. Information from each template is easily accessed, <strong>in</strong>clud<strong>in</strong>g its alignment to<br />

the query sequence. In order to remove the steric clashes, the models are subjected to 100 steps of steepest descents energy<br />

m<strong>in</strong>imization (unrestra<strong>in</strong>ed) by us<strong>in</strong>g the program CHARMM [81]. Additionally, error approximations were made by<br />

measur<strong>in</strong>g the range of equivalent atomic displacements, as calculated from the superposition of all significant homologues<br />

[82]. A new version of 3D-JIGSAW (version 3.0) is also available where templates are identified us<strong>in</strong>g HMM [83] and the<br />

returned alignments are used to generate the models.<br />

3.21.4 Robetta<br />

Robetta web server offers automated structure prediction and analysis tools to deduce prote<strong>in</strong> structural <strong>in</strong>formation<br />

from genomic data. The server utilizes the completely automated structure prediction process that constructs a model<br />

for a whole prote<strong>in</strong> sequence whether the sequence homology to prote<strong>in</strong>s of known structure is available or not. Robetta<br />

breaks down the <strong>in</strong>put sequences <strong>in</strong>to doma<strong>in</strong>s and generates models for doma<strong>in</strong>s with sequence homology to prote<strong>in</strong>s of<br />

known structure us<strong>in</strong>g comparative model<strong>in</strong>g where as models for doma<strong>in</strong>s lack<strong>in</strong>g such homology are constructed us<strong>in</strong>g<br />

the Rosetta de novo structure prediction method. Doma<strong>in</strong> predictions and molecular coord<strong>in</strong>ates of models across the fulllength<br />

query are given as results [84].<br />

The server can also exploit nuclear magnetic resonance (NMR) constra<strong>in</strong>ts data supplied by the user to resolve prote<strong>in</strong><br />

structures us<strong>in</strong>g the RosettaNMR [85,86] protocol. These tools can be used <strong>in</strong> comb<strong>in</strong>ation with exist<strong>in</strong>g structural genomics<br />

<strong>in</strong>itiatives to aid accelerate structure determ<strong>in</strong>ation and expand structural <strong>in</strong>sight for targeted open read<strong>in</strong>g frames (ORFs).<br />

Robetta uses a somewhat modified edition of the de novo structure prediction method [87] so that the queries can be run<br />

with<strong>in</strong> reasonable timescales. Similar to the orig<strong>in</strong>al protocol, Robetta generates three- and n<strong>in</strong>e-residue fragment libraries<br />

that correspond to local conformations seen <strong>in</strong> the PDB, and then assembles models by fragment <strong>in</strong>sertion us<strong>in</strong>g a scor<strong>in</strong>g<br />

function that favors prote<strong>in</strong>-like characteristics. About 10000 decoys are generated for the orig<strong>in</strong>al query where as 5000<br />

decoys for up to two sequence homologs are generated by Robetta. Subsequently, 2000 query decoys and 1000 decoys of<br />

each homolog are selected on the basis of score, and on whether they surpass filters that remove decoys hav<strong>in</strong>g too many<br />

local contacts or unlikely strand topologies. The selected decoys are then clustered based on Ca root-mean-square deviation<br />

(RMSD) above the entire ungapped positions. The top 9 cluster centers are selected as the top ranked models, and the f<strong>in</strong>est<br />

scor<strong>in</strong>g model that passed the filters is chosen as the 10th model. Searches with these f<strong>in</strong>al models are then carried out<br />

aga<strong>in</strong>st a representative set of PDB cha<strong>in</strong>s to discover comparable structures us<strong>in</strong>g Mammoth [88] to recognize potential<br />

similarities to prote<strong>in</strong>s of known structure.<br />

In view of the fact that it is difficult to do de novo structure predictions with high accuracy, two moderately liberal<br />

def<strong>in</strong>itions of correct predictions were def<strong>in</strong>ed. A doma<strong>in</strong> is considered to be accurately modeled if at least 1 of the 10<br />

models has a Mammoth alignment of 50 or more residues with an RMSD of 4Å or less to the native structure, or a Mammoth<br />

Z-score of 6 or better to the native structure. Robetta produces correct predictions for more than half of the doma<strong>in</strong>s on the<br />

basis of these def<strong>in</strong>itions, especially with doma<strong>in</strong>s that consist of either all-alpha or alpha-beta secondary structure.<br />

3.21.5 ESyPred3D<br />

ESyPred3D is an automated homology model<strong>in</strong>g web server. It implements four steps of homology model<strong>in</strong>g approach<br />

which <strong>in</strong>cludes database search<strong>in</strong>g to identify structure homolog, target template alignment, model build<strong>in</strong>g & optimization<br />

and model evaluation. ESyPred3D is available for use at http://www.fundp.ac.be/urbm/bio<strong>in</strong>fo/esypred/. The data and<br />

parameters required to submit the simplest job are the sequence, email id and description of job. In the advance parameter,<br />

PDB ID with cha<strong>in</strong> name can be given to use this as template or PDB ID can be submitted to discard it as template. User has<br />

OMICS Group eBooks<br />

011

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!