01.06.2016 Views

Sequencing

SFAF2016%20Meeting%20Guide%20Final%203

SFAF2016%20Meeting%20Guide%20Final%203

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />

AN IMPROVED ASSEMBLY ALGORITHM FOR DE<br />

NOVO CIRCULAR GENOME RECONSTRUCTION<br />

Wednesday, 1st June 20:00 La Fonda NM Room (1st floor) Poster (PS‐1b.14)<br />

Christian Olsen, Helen Shearman, Richard Moir, Matt Kearse, Simon Buxton,<br />

Matthew Cheung, Jonas Kuhn, Steven Stones Havas, Chris Duran<br />

1 Biomatters, Inc.<br />

Circular chromosomes or genomes, such as viruses, bacteria, mitochondria and plasmids, are a<br />

common occurrence in nature, but despite the wide array of algorithms available for de novo assembly,<br />

the circularity of these DNA molecules is largely overlooked. There are some limitations of this<br />

oversight. Current NGS assembly algorithms assume a linear molecule and will result in a linearly<br />

represented genome, with a breakpoint in an arbitrary position. This is becoming increasingly<br />

problematic with increasing NGS read lengths resulting in fewer contig assemblies with less ambiguity.<br />

Additionally, long read sequencing technologies promise to provide sequences that may greatly span<br />

breakpoints at the expense of coverage resulting in a relatively large quanta of information loss.<br />

Although there are currently methods for the re‐circularisation of contigs post‐assembly by identifying<br />

common trailing/leading sequence motifs, we present a more robust approach of circularising<br />

during the assembly process whilst still allowing the merging of similar and sub‐contigs throughout<br />

the overlap‐based approach. This method can also combat the issue of chimeric sequence due to<br />

contamination, a common problem when sequencing bacterial cultures, due to decreased likelihood<br />

of conserved contig ends due to timely circularisation.<br />

We present results from both 28.6 million read Pan troglodytes illumina data set and 267,491 read<br />

Panthera leo persica (Asiatic Lion) mitochondrial NGS library produced using an Ion Torrent sequencing<br />

machine. These results are discussed and compared to some of the more popular linear<br />

assembly algorithms in common usage today.<br />

Geneious R8 is the first bioinformatics software package to offer a circular de novo assembly method.<br />

The Geneious Circular de novo assembler is developed by Biomatters and may be found at<br />

http://www.geneious.com<br />

80

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!