11.07.2015 Views

Bioinformatics for DNA Sequence Analysis.pdf - Index of

Bioinformatics for DNA Sequence Analysis.pdf - Index of

Bioinformatics for DNA Sequence Analysis.pdf - Index of

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

218 Zaneveld et al.Table 10.1Compositional properties calculated by CodonExplorer. Arbitrary combinations<strong>of</strong> these values may be plotted against one another using the ‘‘Custom scatter plot’’optionNameCAI* – the codonadaptation indexP 3 – The third positionGC contentP 3 /CAILengthEquation/referenceCAI ¼ CAI obs /CAI maxCAI obs ¼QLRSCU kCAI max ¼RSCU ij ¼(14, 43) ð1=LÞk¼1 ð1=LÞQLRSCU k maxk¼1x ijP1 nin i j¼1 x ijG 3 þ C 3P 3 ¼A 3 þ T 3 þ C 3 þ G 3P 3 /CAI (see equations <strong>for</strong> P 3 and CAI, above)Gene lengthHydrophobicity Predicted peptide hydrophobicity (44)Horizontal transfer index(HTI)Ribosomal HTIPrðF jCOD i Þ PrðCOD i ÞPr(COD i |F) = P 6m¼1 PrðF jCOD mÞ PrðCOD m ÞþPrðF jNONÞ PrðNONÞ(m = 1,2,3,4,5,6)The probability that a gene was produced by a Markov model matchingin-frame coding genes in a genome (rather than other frames <strong>of</strong> codinggenes or non-coding sequence). Genes with low HTIs and lowribosomal HTIs (see below) have been proposed as transferred (30)The posterior probability that a gene matches the model <strong>for</strong> in-frameribosomal proteins in a genome, rather than intergenic regionsseparating ribosomal genes or the non-coding frames <strong>of</strong> ribosomalproteins. Calculated as the HTI above, but using ribosomal proteins orintergenic regions separating ribosomal proteins to build the coding andnon-coding models (30)Putative alien A putatively alien gene identified using the scheme <strong>of</strong> Nakamura et al. (30)Amino acid frequenciesThe frequency <strong>of</strong> individual amino acids, or a group <strong>of</strong> amino acids within agene. Available options includeA,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y, (DþE), (K þ R þ H) ,and (L þ I þ V þ M)*The equation shown <strong>for</strong> CAI is the traditional equation (14); the other CAI options are variants <strong>of</strong> thismeasure that may be useful <strong>for</strong> specific analyses.figure is generated showing a histogram <strong>of</strong> the average values <strong>for</strong>each random set (Fig. 10.9). The blue circle (dark grey here)represents the mean value <strong>of</strong> the chosen property <strong>for</strong> these randomsets. The red circle represents the average <strong>of</strong> the chosen property

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!