13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

ANALYSIS OF HUMAN PROMOTERS 221Figure 3. Luciferase activity <strong>of</strong> predicted promoters in a 120-kb region <strong>of</strong> human ch19q13 (provided by Lisa Stubbs).ing region (which can be located by EST/poly(A) mapping).<strong>The</strong>re are also many distal enhancers/silencers/boundary elements that are the most difficult to find becausethey are so far away from the target genes. Even ifthey are found, linking to the correct target genes is stillno easy task. We are focusing on the proximal promoterregion for cis-regulatory element discovery; many <strong>of</strong> themethods may also be applied to other regulatory regionsonce they are approximately localized (e.g., by comparativegenomic analysis, by DNase hypersensitivity mapping,or by enhancer trapping technologies).Computation-<strong>The</strong>n-Validation ParadigmTraditionally, identification <strong>of</strong> a cis-regulatory elementis very laborious: collect known binding sites, build consensusor weight matrix, and search for new loci. One cannotdiscover novel sites in this way. To study human cellcycle regulation, we have developed E2F SiteScan basedon a genetic algorithm trained on known sites in TRANS-FAC and have scanned ~5,000 promoters in the publicdatabase to identify more than 300 E2F targets, many <strong>of</strong>which were also validated by the ChIP-PCR method (Kelet al. 2001). Since the E2F motif was built mainly fromknown cell cycle genes, they may be biased, as E2F alsoplays important roles in other biological pathways (such asapoptosis and DNA repair). By analyzing promoters fromChIP-PCR top candidates, we were able to identify novelE2F targets that do not have the conventional binding motif(Weinmann et al. 2001). However, the scope with PCRis still very limited. When large-scale genome-wide dataand technologies become available, one will be able tostudy TFBSs in the whole genome together with their transcriptionalreadouts. It is expected that computational approacheswill become more indispensable and will playmore important roles in the future (Zhang 2002b,c).Large-scale Gene Expression AnalysisDNA microarray gene expression has become thewidely used method for studying gene regulation. It providesa direct readout <strong>of</strong> cellular transcriptional programs.Interpretation <strong>of</strong> gene expression patterns by ciselements and trans factors, or conversely, by reconstruction<strong>of</strong> regulatory circuits from transcriptional responses,is the main challenge in the 21st century (Zhang 1999a;Banerjee and Zhang 2002). Using cluster analysis followedby motif searching <strong>of</strong> promoters <strong>of</strong> coregulatedgenes, we were quite successful in identification <strong>of</strong> cis elementsinvolved in yeast cell cycle regulation (Spellmanet al. 1998; Zhang 1999b). By combining functional information,such as MIPS (Zhu and Zhang 2000) or GO

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!