28.02.2013 Views

Bio-medical Ontologies Maintenance and Change Management

Bio-medical Ontologies Maintenance and Change Management

Bio-medical Ontologies Maintenance and Change Management

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

322 M.T. Hoque, M. Chetty, <strong>and</strong> A. Sattar<br />

operation per seconds). This is still however, many orders of magnitude lower<br />

than the requirement for a realistic solution.<br />

With the objective of successfully building an effective computational strategy<br />

to unravel the complexities of the sequence-to-folding relationship, even using the<br />

well-established HP model, an efficient <strong>and</strong> robust solution has still to be developed.<br />

In highlighting the various computational intelligence approaches for ab initio<br />

PSP, the next section focuses mainly upon the low resolution HP model.<br />

The HP Model<br />

The HP model introduced by Dill [32, 33] is based on the fact that the hydrophobic<br />

interactions dominate protein folding. The Hs form the protein core freeing up<br />

energy, while the Ps, have an affinity with the solvent <strong>and</strong> so tend to remain in the<br />

outer surface. For PSP, protein conformations of the sequence are placed as a selfavoiding<br />

walk (SAW) on a 2D or 3D lattice. The energy of a given conformation<br />

is defined as a number of topological neighbouring (TN) contacts between those<br />

Hs, which are not sequential with respect to the sequence.<br />

PSP is formally defined as: for an amino-acid sequence n s s s s s , , , , = 1 2 3 L , a<br />

*<br />

conformation c needs to be formed whereby c ∈ C(<br />

s)<br />

, energy<br />

*<br />

{ E(<br />

c)<br />

c C}<br />

E = E(<br />

C)<br />

= min | ∈ [42], where n is the total amino acids in the sequence<br />

<strong>and</strong> C (s)<br />

is the set of all valid (i.e., SAW) conformations of s. If the number<br />

of TNs in a conformation c is q then the value of E (c)<br />

is defined as E( c)<br />

= −q<br />

<strong>and</strong> the fitness function is F = − q . The optimum conformation will have maximum<br />

possible value of |F|. In a 2D HP square lattice model (Fig. 3. (a)), a nonterminal<br />

<strong>and</strong> a terminal residue, both having 4 neighbours can have a maximum of<br />

2 TNs <strong>and</strong> 3 TNs respectively. In a 2D FCC HP model (Fig. 3. (b)), a non-terminal<br />

<strong>and</strong> a terminal residue both having 6 neighbours can have a maximum of 4 TNs<br />

<strong>and</strong> 5 TNs respectively.<br />

Many of the successful PSP software such as ROSETTA [4, 43], PROTINFO<br />

[44, 45], TASSER [46] use various resolution of models embedded into the<br />

(a) (b)<br />

Fig. 3. Conformations in the 2D HP model shown by a solid line. (a) 2D square lattice having<br />

fitness = - (TN Count) = -9. (b) 2D FCC lattice having fitness = -15. ‘ ’ indicates a<br />

hydrophobic <strong>and</strong> ‘ ’ a hydrophilic residue. The dotted line indicates a TN. Starting residue<br />

is indicated by ‘1’.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!