Sequencing

Recommendations

Info

11th Annual Sequencing, Finishing, and Analysis in the Future Meeting QUALITY ASSESSMENT AND VALIDATION CRITERIA – TOWARDS THE DEFINITION OF TABLE 1. Wednesday, 1st June 20:00 La Fonda NM Room (1st floor) Poster (PS‐1b.09) Dominika Borek, Maciej Puzio, Zbyszek Otwinowski UT Southwestern Medical Center Although next‐generation sequencing provides the means to study properties of nucleic acids on unprecedented scales, concise measures for assessing the confidence of results from NGS experiments are lacking. There are tens to hundreds of statistical indicators available right now which separately provide information about: (1) the quality of the sequencing library and the material that was used to generate it, (2) the performance of the equipment, (3) potential biases in the results, and other important sequencingrelated features. However, the average consumer of NGS technology is rarely in a position to efficiently integrate all of this information. This leads to decisions regarding whether an experiment was successful and whether the results are trustworthy frequently being arbi‐ trary. Comparative and meta‐analyses are the area most affected by this; differences are attributed to biological phenomena when they frequently originated from differences in the experimental ap‐ proach. The lack of transparent validation criteria leads not only to the incorrect or sub‐optimal interpretation of results but also to expensive over‐sequencing. We have developed alignment‐free metrics that provide transparent and comprehensive validation of NGS experiment results and define the so‐called Table 1, which concisely summarizes the quality of an experiment and data analysis so that NGS users and reviewers of publications and grant applications can quickly and yet with high certainty asses the quality of a particular NGS experiment. Our approach is based on data mining of sequencing reads, which includes analysis of overdispersion properties. This is followed by the analysis of residuals to detect whether our models of the experiment are sufficiently complete. This approach provides partitioning of uncertainty into components related to error sources and estimates the magnitude of each error source. Together, these directly assess the quality of NGS experiments and contribute to the validation of NGS results. 75
11th Annual Sequencing, Finishing, and Analysis in the Future Meeting NEXTSEQ V2 VS HISEQ RAPID RUN NGS DATA QUALITY COMPARISON IN THE CONTEXT OF DOWNSTREAM ANALYSIS FOR PUBLIC HEALTH SURVEILLANCE OF FOODBORNE BACTERIAL PATHOGENS Wednesday, 1st June 20:00 La Fonda NM Room (1st floor) Poster (PS‐1b.10) Andrew Huang 1 , Rebecca Lindsey 1 , Blake Dinsmore 1 , Jeremy Peirce 2 , Charlotte Steininger 1 , Peyton Smith 1 , Lisley Garcia Toledo 1 , Vikrant Dutta 1 , Janet Pruckler 1 , Kelly Hoon 2 , Collette Fitzgerald 1 , Heather Carleton 1 1 Centers for Disease Control and Prevention, 2 Illumina Inc Introduction Over the last four years, the decreasing cost and ease of use of benchtop sequencers has driven adoption of next‐generation sequencing by local, state, regional, and federal public health laborato‐ ries for use in the identification and surveillance of foodborne pathogens. At the same time, there are a variety of sequencing devices and sequencing chemistries on the market. We had previously observed that there were differences in data quality when the same samples were sequenced using MiSeq v2 compared to using NextSeq v1 chemistries, resulting in significant differences in SNP and allele calling downstream. Because these differences could impact downstream bioinformatic analy‐ ses and outbreak cluster detection, we need to ensure that the same samples sequenced on different sequencers used by public health laboratories perform equivalently in downstream analyses. To this end, we have prepared a standard set of sequencing libraries, run them on HiSeq Rapid Run and NextSeq v2 sequencing chemistries, and compared the basic data quality, as well as wgMLST and hqSNP‐based phylogenetic analyses between these sequencers. Methods Sequencing libraries were generated from genomic DNA extracted from 32 different strains each of Campylobacter, Shiga toxin‐producing Escherichia coli, and Salmonella, using Covaris shearing and the NEB NEXT Ultra chemistry. These libraries were then aliquoted and run on HiSeq Rapid Run 2x250 and NextSeq v2 2x150 sequencing chemistry. To ensure sequencing effort uniformity, the sequencing read sets corresponding to each strain were trimmed to the same read length and down‐ sampled to a common coverage level prior to comparison. Basic quality analysis was conducted using R and QUAST (http://bioinf.spbau.ru/quast). We then built hqSNP‐based phylogenies from the combined sets of data for each species using Lyve‐SET pipeline (https://github.com/lskatz/lyve‐ SET), as well as wgMLST‐based phylogenies for each species using Applied Maths’ Bionumerics version 7.5. Results We noted that both sequencing chemistries had similar quality profiles with an advantage for HiSeq Rapid Run when constructing longer contigs. Despite these differences, the wgMLST analysis from Bionumerics showed no differences in allele calling between HiSeq Rapid Run sequenced samples and NextSeq v2 sequenced samples across any of the 32 samples in each of the three species. While the hqSNP analysis from Lyve‐SET showed no SNP differences between HiSeq Rapid Run sequenced sam‐ ples and NextSeq v2 sequenced samples across any of the 32 samples each in STEC and Salmonella. And only 5 SNP differences in one sample of Campylobacter when comparing HiSeq Rapid Run sequencing and NextSeq v2 sequencing, while the other 31 samples of Campylobacter showed no SNP differences between HiSeq Rapid Run sequencing and NextSeq v2 sequencing. Conclusion Despite small differences in basic data quality scores, similar outcomes were obtained when compar‐ ing sequencing from HiSeq Rapid Run and NextSeq v2 using phylogenetic analyses with hqSNPs or wgMLST. 76
Page 1 and 2:
Sequencing, Finishing, Analysis in
Page 3 and 4:
11th Annual Sequencing, Finishing,
Page 5 and 6:
xGen ® Exome Research Panel • Re
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26: 11th Annual Sequencing, Finishing,
Page 75: 11th Annual Sequencing, Finishing,
Page 127 and 128:
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
Reliable solutions for focused NGS
Page 163 and 164:
Page 165 and 166:
Page 167:
166
show all

Sequencing

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?