11.07.2015 Views

Bioinformatics for DNA Sequence Analysis.pdf - Index of

Bioinformatics for DNA Sequence Analysis.pdf - Index of

Bioinformatics for DNA Sequence Analysis.pdf - Index of

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 5Selection <strong>of</strong> Models <strong>of</strong> <strong>DNA</strong> Evolution with jMODELTESTDavid PosadaAbstractjMODELTEST is a bioin<strong>for</strong>matic tool <strong>for</strong> choosing among different models <strong>of</strong> nucleotide substitution.The program implements five different model selection strategies, including hierarchical and dynamicallikelihood ratio tests (hLRT and dLRT), Akaike and Bayesian in<strong>for</strong>mation criteria (AIC andBIC), and a per<strong>for</strong>mance-based decision theory method (DT). The output includes estimates <strong>of</strong>model selection uncertainty, parameter importances, and model-averaged parameter estimates,including model-averaged phylogenies. jMODELTEST is a Java program that runs under Mac OSX,Windows, and Unix systems with a Java Run Environment installed, and it can be freely downloadedfrom http://darwin.uvigo.es.Key words: Model selection, likelihood ratio tests, AIC, BIC, per<strong>for</strong>mance-based selection, statisticalphylogenetics.1. IntroductionPhylogenetic reconstruction from <strong>DNA</strong> sequences is a problem<strong>of</strong> statistical inference. Since statistical inferences cannotbe drawn in the absence <strong>of</strong> probabilities, the use <strong>of</strong> models <strong>of</strong>nucleotide substitution to calculate probabilities <strong>of</strong> changebetween nucleotides along the branches <strong>of</strong> a phylogenetictree is essential <strong>for</strong> many evolutionary and comparative analyses.Importantly, the use <strong>of</strong> different models <strong>of</strong> <strong>DNA</strong> evolutioncan change the outcome <strong>of</strong> the phylogenetic analysis. Theparameters affected include branch lengths, transition/transversionratio, overall divergence, and rate variation amongsites, whose estimates can be biased under a wrong modelDavid Posada (ed.), <strong>Bioin<strong>for</strong>matics</strong> <strong>for</strong> <strong>DNA</strong> <strong>Sequence</strong> <strong>Analysis</strong>, Methods in Molecular Biology 537ª Humana Press, a part <strong>of</strong> Springer ScienceþBusiness Media, LLC 2009DOI 10.1007/978-1-59745-251-9_593

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!