13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

HUMAN MUTATIONAL PROFILING 25Another aspect <strong>of</strong> DNA sequencing is focused on obtainingthe highest fidelity <strong>of</strong> sequence assignment, orbase-calling, to the resulting data. Specifically, base-callingalgorithms must be tuned to detect polymorphisms andflag them for downstream analysis. This presents somewhat<strong>of</strong> a challenge, since polymorphic sites typically exhibittwo unique characteristics that are not recognizedwell by most DNA sequence base-calling algorithms.Namely, a polymorphic site exhibits a significant drop inpeak height relative to neighboring (non-polymorphic)peaks, and contains a smaller underlying peak producedby the second allele. Although the number <strong>of</strong> commercials<strong>of</strong>tware packages aimed at polymorphism detection is increasing,we have used the publicly available polyphredpackage <strong>of</strong> Nickerson (Nickerson et al. 1997) to identifyand tag polymorphic sites in mutational pr<strong>of</strong>iling data.This algorithm has its functions fully integrated within thephred/phrap/consed s<strong>of</strong>tware packages (Ewing and Green1998; Ewing et al. 1998; Gordon et al. 1998) that we haveused in the production, assembly, and viewing <strong>of</strong> de novogenome sequence data. As such, we have the ability to tagpolymorphic sites and assemble (via phrap) the componentreads that constitute the exonic (or full gene) sequencesfor a given patient sample. Large-scale comparisons<strong>of</strong> cases versus controls or normal versus tumorsamples can then be examined using the consed sequencealignment viewer, in which the polyphred tags can bereadily found and the underlying trace data examined tovalidate the polyphred assignment. Ultimately, these datacan be combined into a visual genotypes output (Nickersonet al. 1998; Rieder et al. 1998) that enables large-scaleexamination and clustering <strong>of</strong> polymorphic sites for multiplegenes in multiple patient samples.Underlying all <strong>of</strong> the data collection aspects <strong>of</strong> a mutationalpr<strong>of</strong>iling project is the crucial aspect <strong>of</strong> sampletracking, which must be perfect and unequivocal in orderfor results and conclusions to be accurate and meaningful.In our laboratory, we rely on an Oracle database and accompanyingschema to enable barcode-based sampletracking <strong>of</strong> patient samples and associated data throughoutthe mutational pr<strong>of</strong>iling pipeline.MUTATIONAL PROFILING PROJECTSAT WASHINGTON UNIVERSITYPulmonary Surfactant Protein B DeficiencyNewborn respiratory distress syndrome (N-RDS) is themost frequent respiratory cause <strong>of</strong> death and morbidity ininfants under one year <strong>of</strong> age in the United States (Guyeret al. 1998). N-RDS has most commonly been attributedin premature infants to pulmonary surfactant deficiencydue to developmental immaturity <strong>of</strong> surfactant production(Avery and Mead 1959; Whitsett and Stahlman1998), since type II pneumocytes do not appear prior to32–34 weeks <strong>of</strong> gestation and lack the ability to producefunctional surfactant. Surfactant replacement therapy hasbeen associated with a decline in N-RDS-associated mortality,but not in long-term respiratory morbidity, hencerevealing the contribution <strong>of</strong> genetic causes <strong>of</strong> N-RDS ininfancy. Genetic disruption <strong>of</strong> surfactant protein B geneexpression has provided unambiguous human and murinerespiratory phenotypes in newborn infants and pups. Surfactantprotein B deficiency was the first identified geneticcause <strong>of</strong> lethal N-RDS in infants and was attributedto a mutation resulting from a 1-bp deletion and a 3-bp insertionat codon 121 <strong>of</strong> the gene (121ins2) (Nogee et al.1993). This mutation creates a new SfuI restriction site,allowing rapid detection <strong>of</strong> the mutation, and leads to atranslation stop signal at codon 214. A truncated, unstablesurfactant protein B transcript results, but no proteinis synthesized (Nogee et al. 1994; Beers et al. 2000). <strong>The</strong>clinical phenotype <strong>of</strong> infants homozygous for the 121ins2mutation is consistent: <strong>The</strong>y are born at full-term, developRDS within the first 12–24 hours after birth, haveno associated extrapulmonary organ dysfunction, andwithout lung transplantation will expire within the first1–6 months <strong>of</strong> life (Hamvas et al. 1997; Nogee 1997).Studies <strong>of</strong> compound heterozygotes <strong>of</strong> 121ins2 indicatethat a minimum net production <strong>of</strong> surfactant protein B between10% and 50% is essential for survival (Ballard etal. 1995; Yusen et al. 1999).<strong>The</strong> DNA sequence <strong>of</strong> the surfactant protein B geneand its regulatory regions span 11 kb <strong>of</strong> the humangenome and are contained in 11 exons, with exons 6 and7 encoding the mature protein and exon 11 providing a3´-untranslated region (Whitsett et al. 1995). <strong>The</strong> mutationalspectrum <strong>of</strong> this gene in affecteds has been pr<strong>of</strong>iledby previous studies, with the 121ins2 mutation found on~60% <strong>of</strong> chromosomes (Nogee et al. 2000). In addition tothis predominant mutation, 16 exonic mutations havebeen characterized. Studies <strong>of</strong> murine knockout or transgeniclineages with disrupted or altered surfactant proteinB gene expression have mimicked many <strong>of</strong> thehistopathologic features <strong>of</strong> the disease in humans, suggestingthat mutations resulting in altered processing,folding, intracellular itinerary, or phospholipid interactionmay be clinically significant. <strong>The</strong>refore, we have initiateda project to perform mutational pr<strong>of</strong>iling across thisgene in multiple patient samples, in an effort to bettercharacterize those mutations in and around the surfactantprotein B gene that appear to predispose infants to RDS.Because <strong>of</strong> the relatively compact nature <strong>of</strong> the SP-Bgene, and our desire to understand both coding and noncodingmutational pr<strong>of</strong>iles for affecteds in this study, wetook an approach that focused on the entirety <strong>of</strong> the genesequence, including regions just 5´ <strong>of</strong> the first exon andthe 3´ UTR, represented as ~500-bp PCR amplicons havinga small amount <strong>of</strong> sequence overlap with adjoiningproducts. Most <strong>of</strong> the patient DNAs used in these studiesare obtained from Chelex-based extraction <strong>of</strong> dried bloodspots taken at birth. <strong>The</strong> patient samples designated forthis study represent large numbers <strong>of</strong> patients (~10,000each) from a diverse geographical collection, includingthe state <strong>of</strong> Missouri; Cape Town, South Africa; Seoul,South Korea; and Oslo, Norway. Comparisons <strong>of</strong> the mutationalpr<strong>of</strong>iles for these distinct geographic and ethnicgroups, when correlated to phenotype, will enable an estimation<strong>of</strong> ethnic frequency <strong>of</strong> the predominant mutationsubtypes, including 121ins2.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!