01.06.2016 Views

Sequencing

SFAF2016%20Meeting%20Guide%20Final%203

SFAF2016%20Meeting%20Guide%20Final%203

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />

THE EXOME COVERAGE AND IDENTIFICATION<br />

(EXCID) REPORT: A GENE SURVEY TOOL FOR<br />

CLINICAL SEQUENCING APPLICATIONS<br />

Wednesday, 1st June 20:00 La Fonda Mezzanine (2nd Floor) Poster (PS‐2b.10)<br />

Rashesh Sanghvi, Kimberly Walker, Qiaoyan Wang, Harsha Doddapaneni, Yi Han,<br />

Huyen Dinh, Eric Boerwinkle, Donna Muzny, Richard Gibbs<br />

Baylor College of Medicine<br />

The Exome Coverage and Identification (ExCID) Report was developed at the Baylor College of<br />

Medicine Human Genome <strong>Sequencing</strong> Center (BCM‐HGSC) to represent gene, transcript and exon<br />

sequence coverage for samples analyzed with panel sequencing, WES and WGS. Since March 2013,<br />

the report has been used to analyze more than 22,888 WES, 9,428 regional captures and 32,660 WGS<br />

research samples for the BCM‐HGSC and more than 23,647 WES/Carrier/CAP‐CLIA samples at Baylor<br />

Miraca Genetics Laboratories (BMGL).<br />

ExCID is a one‐stop tool for providing annotations, gathering mapping metrics such as %reads mapped<br />

and %duplicates reads, coverage metrics such as average coverage and bases 20X, gene% coverages<br />

for OMIM genes and reporting exact bases that are poorly covered based on user‐defined coverage<br />

threshold. ExCID provides gene, transcript and exon annotations to the input BED regions from<br />

RefSeq, VEGA, CCDS and MIRbase. These external databases are downloaded with the installation<br />

of ExCID and can be updated using the scripts provided in the installation.<br />

The user can visualize coverage over the region of interest using the UCSC tracks (WGS and BigBed)<br />

and the OMIM Gene % coverages bar chart. Results are reported as boxplots and other ‘coverage<br />

tracks’ that can be visualized in popular browsers such as the Integrative Genome Viewer (IGV)<br />

and the UCSC Genome Browser. ExCID output was utilized by BMGL website to provide clinicians<br />

easy access to in‐depth information on clinical design aspects and coverage.<br />

In addition, ExCID’s batch ability makes it possible to compare data from hundreds of samples to<br />

reveal trends in the performance of large scale sequencing projects and asses the consistency of<br />

sequencing strategies. This can be used to determine the common problematic regions across<br />

samples leading to design improvements and unique solutions. Analyzing clinical WES samples led<br />

to technical developments such as capture reagent spike‐in (i.e. panel killer) and improving coverage<br />

of nearly all clinically targeted genes (N=3,200 at 100% coverage). Furthermore, ExCID is used for<br />

evaluating the efficiency of in‐house pipelines to meet requirements and set thresholds for various<br />

coverage metrics. ExCID’s ability to asses the percentage of gene covered is routinely used both<br />

clinical and research setting to samples with irregular performance. ExCID has also been used by<br />

the Clinical <strong>Sequencing</strong> Exploratory Research (CSER) <strong>Sequencing</strong> Standards working group, a<br />

consortium of ten clinical sequencing centers across the country, to determine the regions of the<br />

genome that remain difficult to sequence, regardless of the method used. Establishment of robust<br />

WGS pipeline metrics are essential for broad applications in both the research and clinical setting.<br />

98

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!