27.12.2012 Views

l - People

l - People

l - People

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

We also sought to determine whether there existed a bias towards particular protein<br />

classes, in either the Hinge Atlas or the nonredundant set of MolMovDB morphs from<br />

which it was compiled. To do this, we first counted the number of times each top-level<br />

Gene Ontology (GO) term under the “molecular function” ontology was associated with a<br />

protein in the Hinge Atlas. Where the annotation was given for deeper levels, we traced<br />

up the hierarchical tree to retrieve the corresponding top level term in the ontology. Thus<br />

we found, for example, that 14 proteins in the Hinge Atlas were associated with the term<br />

“nucleic acid binding.” We repeated this procedure for the PDB as a whole as well as for<br />

the non-redundant set of 1508 morphs in MolMovDB from which the Hinge Atlas was<br />

compiled. The results for the 10 most frequently encountered GO terms are shown in<br />

Table 2.9.<br />

To compare the Hinge Atlas counts to the PDB counts in an overall fashion, we used the<br />

chi-square distribution with 162 degrees of freedom (from 163 GO terms and 2 datasets)<br />

and obtained a chi-square value of 121.1. This corresponds to a p-value of 0.9931, so<br />

there is no statistically significant difference in the distribution of these terms in the<br />

Hinge Atlas vs. the entire Protein Data Bank.<br />

Statistical comparison of datasets<br />

The Hinge Atlas and computer annotated sets were compiled differently, therefore one<br />

might suspect that the hinges from one set might comprise a statistically different<br />

population from the hinges of the other set. If this were the case, then one of the two sets<br />

78

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!