You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Computational tools and resources for comparative<br />
tree genomics<br />
S4.2<br />
JILL WEGRZYN 1,2 , EMILY GRAU 3 , HANS VASQUEZ- 15:30–16.00<br />
GROSS 3 , TAEIN LEE 4, JODI HUMANN 4 , STEPHEN FICKLIN 4 ,<br />
MARGARET STATON 5 , NATHAN HENRY 5 , DOREEN MAIN 4<br />
and DAVID NEALE 3<br />
jill.wegrzyn@uconn.edu<br />
1 Department of Ecology and Evolutionary Biology, University of<br />
Connecticut, Storrs, CT, USA; 2 Institute for Systems Genomics, University of<br />
Connecticut, Storrs, CT, USA; 3 Department of Plant Sciences, University of<br />
California at Davis, Davis, CA, USA; 4 Department of Horticulture,<br />
Washington State University, Pullman, WA, USA; 5 Department of<br />
Entomology and Plant Pathology, University of Tennessee, TN, USA<br />
Full genome sequences are being generated for the first time for several conifer and hardwood<br />
species. The recent influx of genomic resources from previously underserved tree species presents a<br />
wealth of information and scientific opportunities to the research community. Recipients of this data<br />
are often faced with navigating the complex landscape of online resources with conflicting genome<br />
versions, incomplete annotations, and various levels of resolution. As is often the case with early<br />
genome sequences and genomic sets that pre-date a full genome, the optimal data source and<br />
associated metadata are difficult to obtain. This problem is magnified by the size of genomic<br />
datasets which produce additional barriers once the user brings this information to a desktop<br />
machine for analysis. In addition to genomic data, we are increasingly interested in associating<br />
phenotypic and environmental metrics to geo-referenced trees. These datasets provide additional<br />
computational challenges and reside in repositories even more diverse than those holding genomic<br />
artefacts.<br />
Here, we will describe two clade organism databases – TreeGenes<br />
(http://dendrome.ucdavis.edu/treegenes) and the Hardwood Genomics Web<br />
(http://www.hardwoodgenomics.org/). Aside from direct access to curated genomic resources, we<br />
present two tools – GenSAS and CartograTree. GenSAS is a genome visualization tool that supports<br />
moderate to large genome assemblies and guides researchers through the process of repeat<br />
identification, gene prediction, structural annotation, and comparative genomics. CartograTree is an<br />
interactive workspace that allows for geographical visualization and engagement of high<br />
performance computing (HPC) resources. CartograTree merges genomic, phenotypic, and<br />
environmental data to facilitate association studies in a variety of forest trees.<br />
26