12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

264 R.A. LaskowskiFig. 10.6 A ligand-binding template match from ProFunc. (a) The three purple residues correspond<strong>to</strong> a ligand-binding template for a coenzyme A (coa) ligand, shown coloured by a<strong>to</strong>m type(carbon grey, nitrogen blue, oxygen red, sulphur yellow and phosphorus orange). The templateresidues are Asn138, Ser141 and Cys134 from PDB entry 1s7n, a RimL N(α)-acetyltransferasefrom Salmonella typhimurium. The three yellow residues correspond <strong>to</strong> the residues in the querystructure, PDB entry 2fck, that match the template residues. They are: Asn140, Ser143 andCys136, respectively. The RMSD of the 14 matched side chain a<strong>to</strong>ms is 1.18 Å. (b) As in a, but<strong>with</strong> additional matching residues lying <strong>with</strong>in 10 Å of the template’s centre shown. These areresidues of identical residue type which overlap when the query and template structures are superposedon the basis of the template match. The purple residues are from the template structure(1s7n) while the yellow ones are from the query structure (2fck)Reverse TemplatesThe fourth group of templates are intended <strong>to</strong> find any matches that the first three setsmay have missed. They are the reverse templates which are computed from the targetstructure itself. They are generated using much the same rules as for the ligand- andDNA-binding templates. The main differences are that, firstly, the whole protein structureis considered, rather than merely the residues in contact <strong>with</strong> ligand or DNA, andsecondly that the weighting of each of the templates is by residue conservation (asobtained from a multiple alignment of the sequences returned by a BLAST searchagainst the UniProt sequence database). The templates are selected so that, ideally,each residue in the protein is present in at least one template, although, if <strong>to</strong>o many aregenerated, their number is capped at twice the number of residues in the sequence.The <strong>to</strong>p reverse template hit <strong>to</strong> 2fck is shown in Fig. 10.7. The match is <strong>to</strong> PDBentry 1s7f, a RimL N(α)-acetyltransferase from S. typhimurium. This is the apoform of the 1s7n structure matched by the SSM fold-matching program and theligand-binding templates above.Template Searching and ScoringThe template searches can return hundreds, thousands or even tens of thousandsof matches, particularly in the case of the reverse templates. The problem, then, is<strong>to</strong> discard fortui<strong>to</strong>us matches and retain only significant matches, ranked in order

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!