12.07.2015 Views

View - ResearchGate

View - ResearchGate

View - ResearchGate

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

6Sybil: Methods and Software for Multiple GenomeComparison and VisualizationJonathan Crabtree, Samuel V. Angiuoli, Jennifer R. Wortman,and Owen R. WhiteSummaryWith the successful completion of genome sequencing projects for a variety of model organisms,the selection of candidate organisms for future sequencing efforts has been guided increasingly bya desire to enable comparative genomics. This trend has both depended on and encouraged thedevelopment of software tools that can elucidate and capitalize on the similarities and differencesbetween genomes. “Sybil,” one such tool, is a primarily web-based software package whose primarygoal is to facilitate the analysis and visualization of comparative genome data, with a particularemphasis on protein and gene cluster data. Herein, a two-phase protein clustering algorithm,used to generate protein clusters suitable for analysis through Sybil and a method for creating graphicaldisplays of protein or gene clusters that span multiple genomes are described. When combined,these two relatively simple techniques provide the user of the Sybil software (The Institute forGenomic Research [TIGR] Bioinformatics Department) with a browsable graphical display of hisor her “input” genomes, showing which genes are conserved based on the parameters supplied tothe protein clustering algorithm. For any given protein cluster the graphical display consists of alocal alignment of the genomes in which the clustered genes are located. The genomes are arrangedin a vertical stack, as in a multiple alignment, and shaded areas are used to connect genes in thesame cluster, thus displaying conservation at the protein level in the context of the underlyinggenomic sequences. The authors have found this display—and slight variants thereof—useful for avariety of annotation and comparison tasks, ranging from identifying “missed” gene models or single-exondiscrepancies between orthologous genes, to finding large or small regions of conservedgene synteny, and investigating the properties of the breakpoints between such regions.Key Words: Bioinformatics; Bioperl; comparative genomics; ortholog; paralog; proteinclustering; visualization.1. IntroductionThere are many ways to compare genomes and Sybil focuses on one of thesimplest methods for evaluating potential functional differences betweenFrom: Methods in Molecular Biology, vol. 408: Gene Function AnalysisEdited by: M. Ochs © Humana Press Inc., Totowa, NJ93

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!