13.07.2015 Views

BigFoot: Bayesian Alignment and Phylogenetic Footprinting with ...

BigFoot: Bayesian Alignment and Phylogenetic Footprinting with ...

BigFoot: Bayesian Alignment and Phylogenetic Footprinting with ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Availability <strong>and</strong> Requirements• Project name: <strong>BigFoot</strong>• Project webpage: http://www.stats.ox.ac.uk/ ∼ satija/<strong>BigFoot</strong>/• Operating system: Platform independent• Programming language: Java• Other requirements: Java Virtual Machine 1.5 or higher• License: GNU GPLFigure LegendsFigure 1: Dynamic programming (SAPF) <strong>and</strong> MCMC (<strong>BigFoot</strong>) predictions <strong>and</strong> annotated binding sitesfor eve stripe 2 enhancer. For each nucleotide in the D. melanogaster sequence, both programs output theprobability that the nucleotide was generated by a functional (slow) state. Experimentally verified bindingsites in D. melanogaster for the transcription factors, Bicoid (BC), Hunchback (HB), Kruppel (KR), Giant(GT), <strong>and</strong> Sloppy-paired 1 (Sl1) are shown above the posterior probabilities.Figure 2: <strong>BigFoot</strong> results for the eve stripe 2 enhancer when analyzing four sequences <strong>and</strong> ten sequences.Increasing the number of species in the analysis results in higher posterior probabilities in many experimentallyverified binding sites, <strong>and</strong> increases the nucleotide resoltion when identifying the precise locations forthe TFBS.Figure 3: Two independent <strong>BigFoot</strong> runs on the αMRE enhancer in 12 vertebrate species. Despite havingvery different starting points, the two runs give essentially identical results, indicating convergence of thesampling distribution. The locations of seven previously identified binding sites are displayed above theposterior probabilities. The only binding site not detected <strong>with</strong> greater than 95% probability, bs2, is directlyadjacent to a weakly conserved region (bsAlt) that is undetected by other methods due to alignment errors.Figure 4: <strong>BigFoot</strong> screenshot showing a part of the estimated Maximum Posterior Decoding alignment duringan MCMC run. This screenshot is taken from an analysis of the αMRE enhancer region using 12 vertebratespecies. The blue curve represents the posterior probability of each alignment column, <strong>and</strong> the red curverepresents the phylogenetic footprinting predictions.12

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!