Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
AN IMPROVED ASSEMBLY ALGORITHM FOR DE<br />
NOVO CIRCULAR GENOME RECONSTRUCTION<br />
Wednesday, 1st June 20:00 La Fonda NM Room (1st floor) Poster (PS‐1b.14)<br />
Christian Olsen, Helen Shearman, Richard Moir, Matt Kearse, Simon Buxton,<br />
Matthew Cheung, Jonas Kuhn, Steven Stones Havas, Chris Duran<br />
1 Biomatters, Inc.<br />
Circular chromosomes or genomes, such as viruses, bacteria, mitochondria and plasmids, are a<br />
common occurrence in nature, but despite the wide array of algorithms available for de novo assembly,<br />
the circularity of these DNA molecules is largely overlooked. There are some limitations of this<br />
oversight. Current NGS assembly algorithms assume a linear molecule and will result in a linearly<br />
represented genome, with a breakpoint in an arbitrary position. This is becoming increasingly<br />
problematic with increasing NGS read lengths resulting in fewer contig assemblies with less ambiguity.<br />
Additionally, long read sequencing technologies promise to provide sequences that may greatly span<br />
breakpoints at the expense of coverage resulting in a relatively large quanta of information loss.<br />
Although there are currently methods for the re‐circularisation of contigs post‐assembly by identifying<br />
common trailing/leading sequence motifs, we present a more robust approach of circularising<br />
during the assembly process whilst still allowing the merging of similar and sub‐contigs throughout<br />
the overlap‐based approach. This method can also combat the issue of chimeric sequence due to<br />
contamination, a common problem when sequencing bacterial cultures, due to decreased likelihood<br />
of conserved contig ends due to timely circularisation.<br />
We present results from both 28.6 million read Pan troglodytes illumina data set and 267,491 read<br />
Panthera leo persica (Asiatic Lion) mitochondrial NGS library produced using an Ion Torrent sequencing<br />
machine. These results are discussed and compared to some of the more popular linear<br />
assembly algorithms in common usage today.<br />
Geneious R8 is the first bioinformatics software package to offer a circular de novo assembly method.<br />
The Geneious Circular de novo assembler is developed by Biomatters and may be found at<br />
http://www.geneious.com<br />
80