29.10.2014 Views

advances-in-protein-chemistry

advances-in-protein-chemistry

advances-in-protein-chemistry

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

identifier, s<strong>in</strong>ce a s<strong>in</strong>gle macromolecule can have a numerous motions and the same vital motion can be shared between<br />

diverse macromolecules. Also, each entry has l<strong>in</strong>ks to graphics and movies describ<strong>in</strong>g the motion, frequently portray<strong>in</strong>g a<br />

possible <strong>in</strong>terpolated pathway [31].<br />

2.2.1.7 Orientations of Prote<strong>in</strong>s <strong>in</strong> Membranes (OPM)<br />

OPM database is a compilation of transmembrane, monotopic and peripheral prote<strong>in</strong>s derived from PDB whose<br />

spatial arrangements <strong>in</strong> the lipid bilayer have been theoretically calculated and compared with the experimental data. The<br />

OPM database allows various different types of analysis, for <strong>in</strong>stance, sort<strong>in</strong>g and search<strong>in</strong>g of membrane prote<strong>in</strong>s on the<br />

basis of their structural classification, species, dest<strong>in</strong>ation membrane, numbers of transmembrane segments and subunits,<br />

numbers of secondary structures and the calculated hydrophobic thickness or tilt angle concern<strong>in</strong>g the bilayer normal.<br />

OPM can be browsed either by search<strong>in</strong>g prote<strong>in</strong>s by their name or PDB ID or by sort<strong>in</strong>g of prote<strong>in</strong>s <strong>in</strong> tables for various<br />

specific categories like type, class, superfamily, family, dest<strong>in</strong>ation membrane or biological source. An <strong>in</strong>dividual web page<br />

is created for each membrane prote<strong>in</strong> complex, and coord<strong>in</strong>ate files of all prote<strong>in</strong>s with calculated membrane boundary<br />

planes are accessible for download<strong>in</strong>g <strong>in</strong>dividually for each prote<strong>in</strong> or as a whole dataset. Currently, OPM has about 2172<br />

prote<strong>in</strong>s grouped <strong>in</strong>to 393 superfamilies and 667 families represent<strong>in</strong>g 538 species [32].<br />

2.2.1.8 PROCARB<br />

PROCARB is an open-access database which consists of three separately function<strong>in</strong>g components, i.e., (i) Core<br />

PROCARB module, <strong>in</strong>cludes 3D structures of prote<strong>in</strong>-carbohydrate complexes taken from PDB, (ii) Homology Models<br />

module, consist<strong>in</strong>g of manually constructed 3D models of N-l<strong>in</strong>ked and O-l<strong>in</strong>ked glycoprote<strong>in</strong>s by comparative model<strong>in</strong>g,<br />

and (iii) CBS-Pred prediction module, consists of web servers to predict carbohydrate-b<strong>in</strong>d<strong>in</strong>g sites us<strong>in</strong>g s<strong>in</strong>gle sequence<br />

or server-generated PSSM. In PROCARB, numerous pre-computed structural and functional properties of complexes<br />

are also <strong>in</strong>cluded for rapid analysis. Particularly, <strong>in</strong>formation about function, secondary structure, solvent accessibility,<br />

hydrogen bonds and literature reference is <strong>in</strong>cluded. Additionally, each prote<strong>in</strong> <strong>in</strong> the database is mapped to Uniprot, Pfam,<br />

PDB, and so forth. Currently the PROCARB consists of 604 experimentally verified prote<strong>in</strong> carbohydrate complexes and 26<br />

N-l<strong>in</strong>ked and 20 O-l<strong>in</strong>ked models <strong>in</strong> which at least one experimentally verified glycosylation site was modeled [33]. Figure<br />

1 shows a representative N-l<strong>in</strong>ked homology model of Human Lysosomal alpha-glucosidase with three experimentally<br />

verified glycosylation sites (Uniprot ID = P10253).<br />

ASN390<br />

ASN470<br />

ASN652<br />

Figure 1: 3D model of Human Lysosomal alpha-glucosidase (Uniprot ID = P10253) with three experimentally verified N-l<strong>in</strong>ked glycosylation<br />

sites ASN390 (green), ASN470 (red) and ASN652 (orange).<br />

2.2.2 Prote<strong>in</strong> structure model databases<br />

2.2.2.1 Prote<strong>in</strong> Model Database (PMDB)<br />

PMDB is a relational database of manually generated prote<strong>in</strong> models deposited by users and atta<strong>in</strong>ed with different<br />

structure prediction methods. The database provides easy access to models that have been published <strong>in</strong> the scientific<br />

literature, simultaneously with validat<strong>in</strong>g experimental data. Most of the models <strong>in</strong> PMDB are the predictions which have<br />

been submitted to the Critical Assessment of Techniques for Structure Prediction (CASP) experiment, as well as models<br />

generated by PMDB group, and the models uploaded based on published alignments [2]. For each prote<strong>in</strong> target, one or<br />

more models could be available or several models for different regions of the same target prote<strong>in</strong>. The database provides<br />

some <strong>in</strong>formation for each target and <strong>in</strong>cludes the prote<strong>in</strong> name, sequence and length, organism and, whenever possible,<br />

l<strong>in</strong>ks to the SwissProt sequence database. As soon as the structure of a target is determ<strong>in</strong>ed, the PMDB database entry is also<br />

l<strong>in</strong>ked to the experimental structure <strong>in</strong> the PDB.<br />

2.2.2.2 Molecular Model<strong>in</strong>g Database (MMDB)<br />

MMDB offers simple access to the richness of 3D structure data and its huge potential for functional annotation [34].<br />

MMDB reflects the contents of the PDB and is strongly <strong>in</strong>tegrated with NCBI’s Entrez search and retrieval system. In<br />

MMDB, prote<strong>in</strong> 3D structure data is connected with sequence data, sequence classification resources and PubChem,<br />

provid<strong>in</strong>g easy access to 3D structure data for structural biologists, as well as for molecular biologists and chemists [35]. An<br />

OMICS Group eBooks<br />

07

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!