02.05.2013 Views

Evolution__3rd_Edition

Evolution__3rd_Edition

Evolution__3rd_Edition

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

..<br />

Figure 19.1<br />

The history of the human gene<br />

set. The analysis is based on<br />

protein-coding genes only. The<br />

figure shows the percentage of<br />

human genes that are shared<br />

with other taxa. The fraction<br />

of human genes that originated<br />

on each branch is shown in<br />

the tree, and the total percent<br />

similarity of genes is shown<br />

across the top. Thus, 32%<br />

of modern human genes<br />

originated after eukaryotes<br />

diverged from prokaryotes (but<br />

before animals diverged from<br />

the rest of the eukaryotes), and<br />

53% of human genes are shared<br />

with all other eukaryotes. The<br />

total at the top is approximately<br />

equal to the sum of the<br />

percentages in the tree up to<br />

that branch (for example, 53%<br />

at the top equals 21 + 32% in<br />

the tree). Many modern human<br />

genes originated by duplication<br />

of ancestral genes. Thus, two<br />

genes in humans may be similar<br />

to one gene in yeast. If that gene<br />

is absent from bacteria, both<br />

the human genes are shown<br />

as “originating” in the branch<br />

between prokaryotes and<br />

eukaryotes even though one of<br />

the copies arose by duplication<br />

later. Alternative splicing is<br />

ignored. Non-protein-coding<br />

genes and non-coding DNA<br />

are excluded. From data in<br />

International Human Genome<br />

Sequencing Consortium<br />

(2001).<br />

Time (approx) (Myr)<br />

500<br />

1,500<br />

2,000–2,500<br />

3,500<br />

1%<br />

22%<br />

CHAPTER 19 / <strong>Evolution</strong>ary Genomics 559<br />

99% 75% 53% 21% % genes<br />

Humans Vertebrates<br />

Animals Eukaryotes<br />

(non-vertebrates) (non-animals) Prokaryotes<br />

shared with<br />

humans<br />

The history of our gene set, as described here, is very incomplete and uncertain. It is<br />

incomplete because it is mainly based on comparisons with a small number of other<br />

species, such as the mouse, fruitfly, worm, yeast, and Escherichia coli. The history will<br />

become better known as the genomes of more species are sequenced, and we can compare<br />

human DNA with DNA from a greater range of relatives. The history is also uncertain<br />

for several reasons. The methods used to recognize genes in raw DNA sequence<br />

data are subject to error. Genes may have been overlooked or wrongly compared. Also,<br />

the analysis in Figure 19.1 takes no account of “alternative splicing” (Section 2.2, p. 24).<br />

More than one protein can be read from a single gene, but the analysis considered only<br />

one protein per gene.<br />

Thirdly, homology was inferred from relative sequence similarity. A human protein<br />

was taken to be a homolog of a protein in another species, if the human protein was<br />

much more similar to that other protein than to randomly picked proteins. For conserved,<br />

slowly evolving genes this criterion should give accurate, if approximate,<br />

results. But for genes that have evolved steadily over time, the results may be misleading.<br />

For instance, humans share a more recent common ancestor with all eukaryotes<br />

then with all prokaryotes. A gene in a prokaryote has been evolving away from us<br />

for longer, and may be less likely to be recognized as a homolog of a human gene, than a<br />

gene in a eukaryote. For this reason, the fraction of genes shown as homologous in<br />

Figure 19.1 could be biased by the technique used to recognize homologies. However,<br />

improved criteria of homology can be developed, together with other improvements<br />

in methods and data. We should then be able to flesh out the currently skeletal history<br />

of human DNA.<br />

19.3 The history of duplications can be inferred in a genomic<br />

sequence<br />

24%<br />

Genomes, as a whole or in part, change size during evolution by means of duplications<br />

and deletions (Section 2.5, p. 30). A duplication or deletion will initially be rare in<br />

the population; it may arise as a unique mutation. Its frequency may then increase by<br />

natural selection or random drift. Once a duplication or deletion has spread through<br />

32%<br />

21%

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!