13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

404 CHEUNG ET AL.netic analysis. Expression phenotypes are good modelsfor developing molecular and statistical tools for analyzingquantitative traits more generally. In comparison toother human phenotypes, it is relatively easy to measurea large number <strong>of</strong> expression phenotypes at once in thesame person. <strong>The</strong>re are various mechanisms that mightregulate gene expression in humans. In some cases, theexpression level <strong>of</strong> a gene is regulated only by a cis-actingelement. In others, the expression level is regulated byseveral genes that act in trans or a combination <strong>of</strong> regulatoryelements. <strong>The</strong>se levels <strong>of</strong> complexity make expressionphenotypes good models for the many componentsunderlying complex traits and diseases in humans.ANALYSIS OF NATURAL VARIATION INHUMAN GENE EXPRESSIONOur goals are (1) to define the extent <strong>of</strong> variation ingene expression in normal individuals and (2) to determinethe genetic basis <strong>of</strong> that variation. We analyzed theexpression levels <strong>of</strong> genes in lymphoblastoid cells from50 unrelated individuals from the Utah pedigrees <strong>of</strong> theCentre d’Etude du Polymorphisme Humain (CEPH) usingmicroarrays (Affymetrix Human <strong>Genom</strong>e FocusChip) that contain about 8500 human genes. Expressionpr<strong>of</strong>iling for each sample was performed in duplicate. Asa measure <strong>of</strong> variation in gene expression, for each gene,we calculated the variance ratio (variance among individualsdivided by mean variance between microarray replicateson same individual). This allows us to characterizevariation among individuals relative to measurementnoise (Cheung et al. 2003). We ranked the genes by varianceratio and focused on those that are the most variable(i.e., with the highest variance ratio) since they are likelyto be more amenable to genetic dissection. <strong>The</strong>n, we measuredthe expression levels <strong>of</strong> some <strong>of</strong> these “variable”genes in individuals in large families and also in sets <strong>of</strong>monozygotic twin pairs.the differences in expression levels <strong>of</strong> 15 highly variablegenes among unrelated individuals to those betweenmonozygotic (MZ) twins. We found that in these transformedcells, the variance within MZ twin pairs, as a fraction<strong>of</strong> variance among unrelated individuals, rangedfrom 0.002 to 0.57 (mean 0.19, median 0.19). <strong>The</strong>se findingsindicated that the expression levels <strong>of</strong> genes arehighly correlated among monozygotic twins compared tounrelated individuals. Differences in expression levels <strong>of</strong>genes in lymphoblastoid cells reflect germ-line geneticdifferences, despite the transformation process.VARIABLE GENESWe analyzed data for ~3800 (45%) <strong>of</strong> the 8500 genes,restricting attention to those that were expressed in atleast 10 out <strong>of</strong> the 50 unrelated individuals who werestudied. Among these genes, the variance ratios rangedfrom 0.2 to 48.9 (mean 2.6, median 1.6). For most <strong>of</strong> thegenes, the variance <strong>of</strong> expression levels between individualsis higher than the variance between microarray replicates(Fig. 1). This shows the reproducibility <strong>of</strong> the microarraydata. In our data, some genes have varianceratios that are approximately 1; for these genes, there isno evidence for meaningful variation among individuals.<strong>The</strong>refore, in order to maximize the chance to detect geneticdifferences that account for variation in gene expression,we focus on the genes with variance ratio appreciablygreater than 1. We expect that these are thegenes where variation is most likely to be biologicallymeaningful, and therefore potentially due to genetic differences.<strong>The</strong> points in red in Figure 1 represent the genes(top 5%) with the highest variance ratios.We examined the genomic location <strong>of</strong> these “variable”genes (Fig. 2). We found that they were not clustered inthe genome; instead, they mapped to many sites acrossthe genome. Figure 2 shows the chromosomal locations<strong>of</strong> the 50 most variable genes. For these genes, the rangeLYMPHOBLASTOID CELLS AS RNA SOURCEFOR GENE EXPRESSION ANALYSISWe have chosen to use lymphoblastoid cells in our studyfor several reasons. First, we need an RNA source that canbe obtained from a large number <strong>of</strong> normal individuals inlarge pedigrees. Immortalized lymphocytes (transformedby EBV) are available from all the members <strong>of</strong> the CEPHpedigrees. <strong>The</strong>se are exceptionally large three-generationfamilies that have been studied extensively. Genotypes forgenetic mapping are available for many <strong>of</strong> these families,which will facilitate our effort to map the genetic determinants<strong>of</strong> variation in gene expression. Second, lymphoblastoidcells from our study subjects can be grown undernear-identical conditions. It has been shown that largedifferences are found in expression levels <strong>of</strong> genes that arestudied on different occasions in fresh blood samples fromthe same individual (Whitney et al. 2003). With the lymphoblastoidcells, we can control the growth conditions inorder to minimize environmental variation.We were concerned about how transformation may affectgene expression. To examine this issue, we comparedFigure 1. Scatter plot <strong>of</strong> variance in expression levels betweenindividuals and between microarray replicates for ~3800 genes.<strong>The</strong> genes with the highest variance ratios (top 5%) are highlightedin red. <strong>The</strong> solid line indicates a variance ratio <strong>of</strong> 1.0.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!