13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Harvesting the <strong>Genom</strong>e’s Bounty: Integrative <strong>Genom</strong>icsP. JORGENSEN,* † B.-J. BREITKREUTZ,* K. BREITKREUTZ,* C. STARK,* G. LIU,* M. COOK,* †J. SHAROM,* † J.L. NISHIKAWA,* † T. KETELA, D. BELLOWS,* A. BREITKREUTZ, † I. RUPES,*L. BOUCHER,* † D. DEWAR,* M. VO,* M. ANGELI,* T. REGULY,* A. TONG, †B. ANDREWS, † C. BOONE, † AND M. TYERS* †*Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada M5G 1X5; † Department<strong>of</strong> Medical Genetics and Microbiology, University <strong>of</strong> Toronto, Toronto, Ontario, Canada, M4G 1A8; and Bantingand Best Department <strong>of</strong> Medical Research, University <strong>of</strong> Toronto, Toronto, Ontario, Canada M5G 1L6In this post-genomic era, we face the daunting challenge<strong>of</strong> assigning function to tens <strong>of</strong> thousands <strong>of</strong> uncharacterizedopen reading frames. Furthermore, the physical andfunctional connections between gene products must beelucidated if we are to understand how the linear DNA sequenceencodes dynamic cellular function. An unexpectedresult <strong>of</strong> the Human <strong>Genom</strong>e Project is the low number <strong>of</strong>predicted human genes, roughly fivefold more than in single-celledyeasts and approximately double that in flies orworms (Lander et al. 2001; Venter et al. 2001). Increasedconnectivity between this relatively constant number <strong>of</strong>genes may underlie the massive increase in system-levelcomplexity that distinguishes yeast from humans.<strong>Genom</strong>e-wide approaches to discovery <strong>of</strong> gene functionnow include systematic analysis <strong>of</strong> genetic interactions,protein interactions, genome-wide expression pr<strong>of</strong>iles, andmutant phenotypes. More than any single approach, each<strong>of</strong> which is subject to caveats in interpretation and reproducibility,the intersection <strong>of</strong> orthogonal genome-scaledata sets provides a robust means to interrogate gene function.Here, we summarize advanced genome-scale methodsfor biological discovery in the budding yeast Saccharomycescerevisiae, many <strong>of</strong> which will be transportable tointerrogation <strong>of</strong> human gene function.OVERVIEW OF FUNCTIONAL GENOMICSTOOLS IN YEAST<strong>The</strong> S. cerevisiae genome sequence, completed in 1996,was the first reported for an autonomous life form. Sincethen, the powerful molecular genetics <strong>of</strong> the budding yeastsystem has afforded the first detailed analysis <strong>of</strong> a genomeand its encoded proteins. Even though few yeast geneshave introns, reliable open reading frame (ORF) predictionhas proven non-trivial, due to both sequencing errorsand a preponderance <strong>of</strong> short ORFs. Recently, the geneannotation problem has been effectively solved by comparativegenome sequence analysis <strong>of</strong> S. cerevisiae andthree closely related yeast species, S. paradoxus, S.mikatae, and S. bayanus (Kellis et al. 2003). Through thisapproach, 5,726 conserved ORFs have been annotated asbona fide genes, including 148 new ORFs <strong>of</strong> less than 100residues. Notably, 500 <strong>of</strong> the originally predicted ORFsfail to meet the criterion <strong>of</strong> conservation and are likelyspurious. Comparative genome analysis has also pinpointedconserved sequence elements in promoter regions,which are presumed to correspond to transcription factorbindingsites (Cliften et al. 2003; Kellis et al. 2003).<strong>The</strong> yeast genome sequence has spawned a host <strong>of</strong> systematicapproaches, many <strong>of</strong> which have since been appliedto other model systems. For example, the genomesequence allowed the first genome-wide transcriptionalpr<strong>of</strong>iles to be recorded by DNA microarray technology(DeRisi et al. 1997). Public databases now contain expressiondata for over 1,000 different experimental conditions(Sherlock et al. 2001). This sizable compendium<strong>of</strong> genome-wide expression data allows many gene functionsand small-molecule targets to be assigned simply byclustering <strong>of</strong> gene expression pr<strong>of</strong>iles (Hughes et al.2000). Insight into transcriptional regulatory circuits hasbeen further enhanced by the development <strong>of</strong> array-basedchromatin-immunoprecipitation methods that in principleallow identification <strong>of</strong> all DNA sequence elements boundby transcription factors and other chromatin-associatedproteins (Ren et al. 2000; Iyer et al. 2001).<strong>The</strong> ease with which genes can be deleted or otherwisemanipulated by homologous recombination is a key attribute<strong>of</strong> budding yeast. This feature was exploited by aninternational consortium <strong>of</strong> laboratories to systematicallydelete ~95% <strong>of</strong> all predicted yeast genes (Giaever et al.2002). <strong>The</strong> resulting collection <strong>of</strong> ~6,000 gene deletionstrains has allowed numerous high-throughput approachesfor characterization <strong>of</strong> gene function. Phenotypicanalysis using the deletion set is augmented by animaginative design feature, namely the inclusion <strong>of</strong>unique 20-mer oligonucleotide tags that flank each PCRdeletion cassette, aptly termed barcodes (Winzeler et al.1999). <strong>The</strong> relative abundance <strong>of</strong> all gene deletion strainsin a population pool can then be determined by PCR amplification<strong>of</strong> all barcodes from genomic DNA usingcommon primers, followed by hybridization <strong>of</strong> labeledPCR products to a microarray <strong>of</strong> all ~12,000 barcode sequences.Competitive growth <strong>of</strong> the deletion pool underany given selection thus identifies gene deletions thatconfer sensitivity by reduction in the cognate barcode signals.This method has been used to quantify strain growthdefects and strain sensitivity to a variety <strong>of</strong> stress conditions,including osmotic stress, DNA damage, and bioac-Cold Spring Harbor Symposia on Quantitative Biology, Volume LXVIII. © 2003 Cold Spring Harbor Laboratory Press 0-87969-709-1/04. 431

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!