Scientific SessionAbstracts(S5) High-Throughput Genome Centers(S5-1) Overview <strong>of</strong> the Illumina Sequencing Platformat the Broad InstituteK. ConnollyProcess & Technology Development, Genome SequencingPlatform, The Broad Institute, Cambridge, MA, United StatesThe constant increase in quality and quantity <strong>of</strong> Next-Generationsequencing data necessitates a parallel growth in sample preparationand a scalable tracking system. The Broad Institute’s Illumina SequencingPlatform handles a variety <strong>of</strong> applications and comprises Illumina’s latesthardware, s<strong>of</strong>tware and kit releases, and a high-throughput samplepreparation process. With our automated sample preparation and QCprocesses, we have been able to meet our increased capacity goals <strong>of</strong>up to 3,840 libraries per week, and have reduced our rework rate to5% through attaining target cluster densities with high reproducibility.We continue to work closely with Illumina to develop the sequencingtechnology, using data to drive process improvements and exploringmethods to improve GC bias. Maximizing platform-wide efficiency ispossible through the implementation and continuous development <strong>of</strong>tools for process quality control and centralized communication, suchas our real-time run monitoring dashboard and JIRA tracking system.We were able to convert rapidly from GAIIxs to HiSeq2000s byestablishing and using an enterprise Knowledge Management system,with which we can efficiently accumulate and disseminate a changingknowledge base. All <strong>of</strong> these improvements are applicable to both theGA and HiSeq platforms.(S5-2) Science and Technology at a HighThroughput Genome CenterL. Fulton, R. Wilson, The Genome Center ProductionGroupThe Genome Center at Washington University School <strong>of</strong>Medicine, St. Louis, MO, United StatesThe Genome Center (GC) at Washington University School <strong>of</strong> Medicinehas developed a state <strong>of</strong> the art genomics facility. Our scientists workon a variety <strong>of</strong> cutting edge projects with researchers from aroundthe world. These collaborative research projects lead to cutting edgeadvances in the field <strong>of</strong> genomics. The structural organization at theGC reflects these efforts and is centered around six major scientificareas: Transcriptome Sequencing, Genome Assembly, Whole GenomeSequencing, Human Microbiome, Human Genetics, and TargetedResequencing. These specific scientific areas are supported by onecentral data production pipeline. Attributes <strong>of</strong> this pipeline includedetailed sample screening protocols, sample barcoding capabilitiesthat allow for a broad range <strong>of</strong> sample cohorts, multiplatform dataproduction, and the ability to select from more than one method <strong>of</strong>sequencing strategies. All <strong>of</strong> this is supported by one centralized LIMSgroup dedicated to maintaining and developing the data productioncapabilities. The technology development group investigates newtechniques and instrumentation prior to any changes in the main dataproduction pipeline. Only robust protocols and instrumentation areallowed into the data production pipeline. This strategy allows TheGenome Center to run a base data production pipeline while constantlyinfusing high quality advances. Sequence data for each project is sentinto an advanced analysis pipeline built to conduct a multitude <strong>of</strong>assessments. When needed, validation (a second sequence event) canbe used to confirm variants detected by the analysis s<strong>of</strong>tware.(S5-3) High-Throughput Next GenerationSequencing Methods and ApplicationsD. Muzny 1 , M. Wang 1 , I. Newsham 1 , Y.Q. Wu 1 , H. Dinh 1 ,C. Kovar 1 , J. Santibanez 1 , A. Sabo 1 , J. Reid 1 , M. Bainbridge 1 ,E. Boerwinkle 2 , T. Albert 3 , R. Gibbs 11Baylor College <strong>of</strong> Medicine, Human Genome SequencingCenter, Houston, TX, United States; 2 University <strong>of</strong> TexasHealth Science Center at Houston, School <strong>of</strong> Public Health,Houston, TX, United States; 3 Roche NimbleGen, Inc.,Madison, WI, United StatesSecond Generation high-throughput sequencing technologies haverevolutionized the genome sequencing applications and will ultimatelyhave great impact on personalized medicine. The increase in capacity<strong>of</strong> both the AB/Life Technologies SOLiD 4.0 and Illumina HiSeqinstrumentation and the ability <strong>of</strong> the platforms to multiplex sampleshas led to process innovations impacting many ongoing projectsat the HGSC. Applications have ranged from regional and wholeexome capture sequencing to the use <strong>of</strong> whole genome shotgun fordeep coverage and determining structural rearrangements. Internaladvancements have complemented the higher capacity instrumentationthrough the implementation <strong>of</strong> library automation, low DNA inputsamples, capture hybridization multiplexing and application <strong>of</strong> readmapping tools such as BFAST and BWA. Development <strong>of</strong> sample intakeprocedures, LIMS tracking and defined reporting metrics has enabledNexGen sequencing pipelines that can effectively deliver targetedand whole genome shotgun data for thousands <strong>of</strong> samples. Thesetechnical advancements to the pipeline have allowed us to achievea rate <strong>of</strong> ~1500 libraries/captures per month. To date the center hascompleted over 5000 exome and regional capture libraries for TheCancer Genome Atlas (TCGA), NIMH Autism, Cohorts for Heart andAging Research in Genomic Epidemiology (CHARGE-S) and 1000Genomes Project. Development <strong>of</strong> these applications and methods willbe discussed along with key data metrics, process management andpipeline organization.(S6) Strategies for Deep Mining <strong>of</strong> ComplexProtein Mixtures(S6-1) Coverage and Recovery <strong>of</strong> Upstream ProteinFractionation Methods in LC-MS/MS WorkflowsL.J. FosterCentre for High-Throughput Biology, The University <strong>of</strong> BritishColumbia, Vancouver, BC, CanadaThe proteome <strong>of</strong> any cell or even any subcellular fraction remainstoo complex for complete analysis by one dimension <strong>of</strong> liquidchromatography-tandem mass spectrometry (LC-MS/MS). Hence, toachieve greater depth <strong>of</strong> coverage for a proteome <strong>of</strong> interest, mostgroups routinely subfractionate the sample prior to LC-MS/MS sothat the material entering LC-MS/MS is less complex than the originalsample. Protein and/or peptide fractionation methods that biochemistshave used for decades, such as strong cation exchange chromatography(SCX), isoelectric focusing (IEF) and SDS-PAGE, are the most commonprefractionation methods used currently. There has, as yet, been nocomprehensive, controlled evaluation <strong>of</strong> the relative merits <strong>of</strong> thevarious methods, although some binary comparisons have been made.We will discuss the most popular methods for fractionating samples atboth the protein and peptide level, demonstrating quantitatively which40 • <strong>ABRF</strong> <strong>2011</strong> — Technologies to Enable Personalized Medicine
are the best methods for optimal recovery and proteome coverage.A novel approach for fractionating samples at the level <strong>of</strong> proteincomplexes will also be discussed.(S6-2) Improving the Comprehensiveness <strong>of</strong> Large-Scale Proteomics Experiments Using AdvancedComputational Tools and Accurate MultipleHypothesis Testing StatisticsM.J. MacCoss, J. Egertson, B. Frewen, L. Käll, W. NobleDepartment <strong>of</strong> Genome Sciences, University <strong>of</strong> Washington,Seattle, WA, United StatesMass spectrometry based technology for the analysis <strong>of</strong> complexprotein mixtures has improved at an amazing rate. With each newinstrument release, mass spectrometers have become more sensitiveand have faster MS/MS data acquisition speeds. Furthermore,instruments are continuously improving the dynamic range, massaccuracy, and resolution <strong>of</strong> the resulting mass spectrometry data. All<strong>of</strong> these developments have increased the number <strong>of</strong> peptides thatcan be identified and quantified without extending the overall analysistime. While the technological hardware advances that are required toincrease the number <strong>of</strong> peptide identifications by 50% with a constantanalysis time is monumental, we have been able to demonstrate thatincrease in performance without increasing the analysis time at all. Toaccomplish this, we have made use <strong>of</strong> improved database searchingalgorithms, spectrum library searching, use <strong>of</strong> chromatographicretention time, powerful machine learning tools, accurate multiplehypothesis testing statistics, and many more. Strategies will bediscussed on how to increase the comprehensiveness <strong>of</strong> any datasetusing improved data analysis strategies.(S6-3) In-Depth Analysis <strong>of</strong> Human and MousePlasma Using 3-D And 4-D Fractionation StrategiesD.W. SpeicherThe Wistar Institute, Philadelphia, PA, United StatesIn-depth pr<strong>of</strong>iling <strong>of</strong> plasma proteomes can potentially identify noveldisease biomarkers. But few biomarkers identified by proteomicapproaches have advanced to early-stage clinical testing because they<strong>of</strong>ten are not sufficiently disease specific. Major challenges in plasmaproteome analysis include the very wide dynamic range <strong>of</strong> proteinconcentrations, the high protein complexity, and the substantialheterogeneity <strong>of</strong> most protein concentrations in the normal humanpopulation. Because most disease-specific biomarkers are present inblood at very low concentrations, extensive fractionation is requiredprior to LC-MS/MS analysis. In general, more fractionation will result ingreater depth <strong>of</strong> analysis, but there is a point <strong>of</strong> diminishing return foreach fractionation method and throughput decreases as the number<strong>of</strong> LC-MS/MS runs per proteome increases. A common feature <strong>of</strong> mostcurrent plasma pr<strong>of</strong>iling methods is to first immunodeplete as manyhigh abundance plasma proteins as possible, followed by extensiveprotein fractionation <strong>of</strong> the depleted plasma prior to trypsin digestionand LC-MS/MS. In addition, reliable quantitative comparisons areneeded for most types <strong>of</strong> studies. While all quantitative methods havestrengths and weaknesses, label-free quantitative comparison <strong>of</strong> LC-MSsignals is increasing in popularity and seems adequately reproduciblefor most studies. Our laboratory commonly uses two alternative plasmaproteome analysis strategies. One powerful approach utilizes a 3-Dprotein/peptide pr<strong>of</strong>iling method consisting <strong>of</strong> depleting 20 abundantproteins followed by 1-D SDS PAGE, fractionation <strong>of</strong> the gel lane into20 to 60 fractions and LC-MS/MS analysis. Proteins can be quantitativelycompared using label-free analysis <strong>of</strong> ion current patterns from the MSfull scans. An even greater depth <strong>of</strong> analysis can be achieved using a4-D protein/peptide pr<strong>of</strong>iling strategy utilizing microscale solutionisoelectr<strong>of</strong>ocusing <strong>of</strong> proteins prior the SDS gel in the 3-D scheme,although throughput is substantially reduced.(S7) Next Generation Sequencing Technologieson the Horizon(S7-1) Towards Optical DNA Sequencing UsingNanopore ArraysA. MellerDepartment <strong>of</strong> Biomedical Engineering, Boston University,Boston, MA, United StatesNext generation DNA sequencing methods that utilize nanometersizepores have been subject <strong>of</strong> numerous studies in past years. One<strong>of</strong> the compelling features <strong>of</strong> the nanopore technique lies in its abilityto electrophoretically focus long DNA strands towards the pore areaand thread the molecules inside the pore in a highly efficient manner.Thus extremely small copy numbers <strong>of</strong> the target DNA are requiredfor analyses†, circumventing the need for costly and time-consumingtarget amplification. One <strong>of</strong> the major bottlenecks for the realization<strong>of</strong> a viable nanopore-based DNA sequencing has been the ability tosimultaneously read the electrical signals from hundreds to thousands<strong>of</strong> nanopores densely fabricated on sub-millimeter size silicon chip. Toaddress this issue we develop an extremely high throughput singlemoleculeDNA sequencing technique, which employs optical, widefieldreadout from DNA molecules electrically mobilized throughthe nanopores. Our method consists <strong>of</strong> two steps: First, target DNAmolecules are converted according to pre-determined code, which isrecognized by molecular beacons with four types <strong>of</strong> fluorophores (eachuniquely corresponding to one <strong>of</strong> the four DNA bases). Solid-statenanopores are then used to sequentially strip <strong>of</strong>f the beacons, leadingto a series <strong>of</strong> photon bursts that can be detected with a custom mademicroscope. Notably the method circumvents the use <strong>of</strong> enzymes in thereadout stage, and is thus not affected by their limited processivity andlifetime. Here we demonstrate the feasibility <strong>of</strong> this method using a twocolor model system, and show for the first time, individual nucleotiderecognition from multiple nanopores simultaneously‡, allowingstraightforward parallelization <strong>of</strong> our system to nanopore arrays. †.Wanunu, M., W. Morrison, Y. Rabin, A. Y. Grosberg, and A. Meller. 2010.Electrostatic Focusing <strong>of</strong> Unlabeled DNA into Nanoscale Pores using aSalt Gradient. Nature Nanotechnology 5:160-165. ‡. McNally, B., A.Singer, Z. Yu, Y. Sun, Z. Weng, and A. Meller. 2010. Optical Recognition<strong>of</strong> Converted DNA Nucleotides for Single-Molecule DNA SequencingUsing Nanopore Arrays. Nano Letters 10:2237-2244.Scientific SessionAbstracts<strong>ABRF</strong> <strong>2011</strong> — Technologies to Enable Personalized Medicine • 41
- Page 7 and 8: Meeting SponsorsThe Association of
- Page 9 and 10: Registration ServicesRegistration S
- Page 12 and 13: Presenter InformationPresenter Info
- Page 14 and 15: AwardsABRF Annual Award for Outstan
- Page 16 and 17: ABRF Robert A. Welch Outstanding Re
- Page 18 and 19: Program-at-a-GlanceProgram-at-a-Gla
- Page 20 and 21: Sunday, February 20 (Continued)(S1-
- Page 22 and 23: Sunday, February 20 (Continued)(W3-
- Page 24 and 25: Monday, February 21 (Continued)12:0
- Page 26 and 27: Tuesday, February 22 (Continued)9:0
- Page 28 and 29: Tuesday, February 22 (Continued)(W1
- Page 30 and 31: Satellite Educational Workshop Spon
- Page 32 and 33: SW2: An Introduction to Metabolomic
- Page 34 and 35: SW4: Lean Management in Core Facili
- Page 36 and 37: Plenary Session AbstractsPlenary Se
- Page 38 and 39: Scientific Session AbstractsScienti
- Page 40 and 41: Scientific SessionAbstractsstatisti
- Page 44 and 45: Scientific SessionAbstracts(S7-2) S
- Page 46 and 47: Workshop Session AbstractsWorkshop
- Page 48 and 49: Workshop SessionAbstractscase incor
- Page 50 and 51: Workshop SessionAbstracts(W7) Cellu
- Page 52 and 53: Workshop SessionAbstractsresolution
- Page 54 and 55: (W14-3) Galaxy Next Generation Sequ
- Page 56 and 57: Research Group Presentation Abstrac
- Page 58 and 59: Research GroupPresentation Abstract
- Page 60 and 61: Research GroupPresentation Abstract
- Page 62 and 63: Poster Session Abstracts**ABRF Post
- Page 64 and 65: Poster Abstractsglobal gene express
- Page 66 and 67: Poster Abstractsdiscussions of cont
- Page 68 and 69: Poster Abstractspossible applicatio
- Page 70 and 71: Poster Abstracts129 Bravo Automated
- Page 73 and 74: distribution. Additionally, we show
- Page 75 and 76: Illumina Genome Analyzer. PicoPlex
- Page 77 and 78: uniformity, reproducibility of enri
- Page 79 and 80: kits for Tempus TM and PAXgene® st
- Page 81 and 82: 165 Benchmarking miRNA ExpressionLe
- Page 83 and 84: Phoenix crystallography robot and a
- Page 85 and 86: Laboratory is located next door and
- Page 87 and 88: data of a Thermo Orbitrap instrumen
- Page 89 and 90: **197 Multi-Functional Superparamag
- Page 91 and 92: pattern of N-linked glycans in form
- Page 93 and 94:
advantage of using two separate 15
- Page 95 and 96:
allowed absolute quantitation. Comp
- Page 97 and 98:
levels between the animal groups. A
- Page 99 and 100:
MuDPIT, and off-gel electrophoresis
- Page 101 and 102:
237 Multi-Site Assessment of Proteo
- Page 103 and 104:
with good limits of quantification
- Page 105 and 106:
249 Rapid Monoclonal Antibody Glyca
- Page 107 and 108:
OOng, J. - 117PPatel, S. - 169Peake
- Page 109 and 110:
AnaSpec, Eurogentec Group Booth 417
- Page 111 and 112:
FASEB MARC Program Booth 4169650 Ro
- Page 113 and 114:
IntegenX, Inc. Booth 2015720 Stoner
- Page 115 and 116:
Nonlinear Dynamics Booth 1052530 Me
- Page 117 and 118:
Roche Applied Science Booth 5009115
- Page 119 and 120:
Exhibit Hall FloorplanGrand Oaks Ba
- Page 121 and 122:
Exhibitor List in Booth OrderExhibi
- Page 123 and 124:
This workshop will present ways to
- Page 125 and 126:
Grand Oaks Ballroom,Rooms P&Q, Leve
- Page 127 and 128:
NotesNotesABRF 2011 — Technologie
- Page 129 and 130:
Tuesday, February 22 — 12:00 pm -
- Page 131 and 132:
MARCH 16-20, 2012 • DISNEY’S CO