12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

4 Membrane <strong>Protein</strong> <strong>Structure</strong> Prediction 99Fig. 4.3 A Kyte-Doolittle hydropathy plot. The protein sequence is scanned <strong>with</strong> a sliding windowof size 19–21 residues. At each position, the mean hydrophobic index of the amino acids<strong>with</strong>in the window is calculated and that value plotted as the midpoint of the window. This plotrepresents a TM protein <strong>with</strong> 4 TM heliceshave since been replaced by machine learning approaches which prevail overhydrophobicity methods due <strong>to</strong> their probabilistic formulation. A selection ofmachine learning-based predic<strong>to</strong>rs can be found in Table 4.3.Hidden Markov models (HMMs) were first applied <strong>to</strong> TM <strong>to</strong>pology prediction inTMHMM (Krogh et al. 2001) and HMMTOP (Tusnády and Simon 1998), and haveproved highly successful. TMHMM implements a cyclic model <strong>with</strong> seven states fora TM helix, while HMMTOP uses HMMs <strong>to</strong> distinguish between five structuralstates [helix core, inside loop, outside loop, helix caps (C and N) and globulardomains]. These states are connected by transition probabilities before dynamicprogramming is used <strong>to</strong> match a sequence against a model <strong>with</strong> the most probable<strong>to</strong>pology. HMMTOP also allows constrained predictions <strong>to</strong> be made, where specificresidues can be fixed <strong>to</strong> a <strong>to</strong>pological location based on experimental data.Neural networks (NNs) are employed by methods including PHDhtm (Rostet al. 1996) and MEMSAT3 (Jones 2007). PHDhtm uses multiple sequence alignments<strong>to</strong> perform a consensus prediction of TM helices by combining two NNs.The first creates a ‘sequence-<strong>to</strong>-structure’ network which represents the structuralpropensity of the central residue in a window. A ‘structure-<strong>to</strong>-structure’ networkthen smoothes these propensities <strong>to</strong> predict TM helices, before the positive-insiderule is applied <strong>to</strong> produce an overall <strong>to</strong>pology. MEMSAT3 uses a neural networkand dynamic programming in order <strong>to</strong> predict not only TM helices, but also <strong>to</strong> scorethe <strong>to</strong>pology and <strong>to</strong> identify possible signal peptides. Additional evolutionary informationprovided by multiple sequence alignments led <strong>to</strong> prediction accuraciesincreasing <strong>to</strong> as much as 80% using one dataset (Jones 2007) .

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!