Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
THE EXOME COVERAGE AND IDENTIFICATION<br />
(EXCID) REPORT: A GENE SURVEY TOOL FOR<br />
CLINICAL SEQUENCING APPLICATIONS<br />
Wednesday, 1st June 20:00 La Fonda Mezzanine (2nd Floor) Poster (PS‐2b.10)<br />
Rashesh Sanghvi, Kimberly Walker, Qiaoyan Wang, Harsha Doddapaneni, Yi Han,<br />
Huyen Dinh, Eric Boerwinkle, Donna Muzny, Richard Gibbs<br />
Baylor College of Medicine<br />
The Exome Coverage and Identification (ExCID) Report was developed at the Baylor College of<br />
Medicine Human Genome <strong>Sequencing</strong> Center (BCM‐HGSC) to represent gene, transcript and exon<br />
sequence coverage for samples analyzed with panel sequencing, WES and WGS. Since March 2013,<br />
the report has been used to analyze more than 22,888 WES, 9,428 regional captures and 32,660 WGS<br />
research samples for the BCM‐HGSC and more than 23,647 WES/Carrier/CAP‐CLIA samples at Baylor<br />
Miraca Genetics Laboratories (BMGL).<br />
ExCID is a one‐stop tool for providing annotations, gathering mapping metrics such as %reads mapped<br />
and %duplicates reads, coverage metrics such as average coverage and bases 20X, gene% coverages<br />
for OMIM genes and reporting exact bases that are poorly covered based on user‐defined coverage<br />
threshold. ExCID provides gene, transcript and exon annotations to the input BED regions from<br />
RefSeq, VEGA, CCDS and MIRbase. These external databases are downloaded with the installation<br />
of ExCID and can be updated using the scripts provided in the installation.<br />
The user can visualize coverage over the region of interest using the UCSC tracks (WGS and BigBed)<br />
and the OMIM Gene % coverages bar chart. Results are reported as boxplots and other ‘coverage<br />
tracks’ that can be visualized in popular browsers such as the Integrative Genome Viewer (IGV)<br />
and the UCSC Genome Browser. ExCID output was utilized by BMGL website to provide clinicians<br />
easy access to in‐depth information on clinical design aspects and coverage.<br />
In addition, ExCID’s batch ability makes it possible to compare data from hundreds of samples to<br />
reveal trends in the performance of large scale sequencing projects and asses the consistency of<br />
sequencing strategies. This can be used to determine the common problematic regions across<br />
samples leading to design improvements and unique solutions. Analyzing clinical WES samples led<br />
to technical developments such as capture reagent spike‐in (i.e. panel killer) and improving coverage<br />
of nearly all clinically targeted genes (N=3,200 at 100% coverage). Furthermore, ExCID is used for<br />
evaluating the efficiency of in‐house pipelines to meet requirements and set thresholds for various<br />
coverage metrics. ExCID’s ability to asses the percentage of gene covered is routinely used both<br />
clinical and research setting to samples with irregular performance. ExCID has also been used by<br />
the Clinical <strong>Sequencing</strong> Exploratory Research (CSER) <strong>Sequencing</strong> Standards working group, a<br />
consortium of ten clinical sequencing centers across the country, to determine the regions of the<br />
genome that remain difficult to sequence, regardless of the method used. Establishment of robust<br />
WGS pipeline metrics are essential for broad applications in both the research and clinical setting.<br />
98