13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

EVOLUTION OF EUKARYOTIC GENES AND INTRONS 295bidopsis, and minimal in the microsporidian (Fig. 1). Anotable difference was observed in the representation <strong>of</strong>eukaryotic genomes in KOGs with different numbers <strong>of</strong>species. <strong>The</strong> three unicellular organisms are representedlargely in highly conserved seven- or six-species KOGs;in contrast, in animals and Arabidopsis, a much largerfraction <strong>of</strong> the genes is accounted for by LSEs and byKOGs that include three or four genomes, e.g., animalspecificones (Fig. 1).<strong>The</strong> phyletic patterns <strong>of</strong> KOGs reveal a substantial,conserved eukaryotic gene core, but also notable diversity(Fig. 2). <strong>The</strong> “pan-eukaryotic” genes, which are representedin each <strong>of</strong> the seven analyzed genomes, comprise~20% <strong>of</strong> the KOGs, and approximately the samenumber <strong>of</strong> KOGs include all species except for the microsporidian,an intracellular parasite with a highly degradedgenome (Katinka et al. 2001). Among the remainingKOGs, a large fraction consists <strong>of</strong> representatives <strong>of</strong>the three analyzed animal species (worm, fly, and human),but ~30% are KOGs with unexpected patterns, e.g.,one animal, one plant, and one fungal species (seehttp://www.ncbi.nlm.nih.gov/COG/new/shokog.cgi).During manual curation <strong>of</strong> the KOGs, those with unexpectedpatterns were specifically scrutinized to identifypotential highly diverged members from one or more <strong>of</strong>the analyzed genomes, and the KOGs were modifiedwhen such diverged orthologs were detected. Some <strong>of</strong>these unexpected patterns probably indicate that a gene isstill missing in the analyzed set <strong>of</strong> protein sequences fromone or more <strong>of</strong> the included eukaryotic species. Largely,however, the unexpected phyletic patterns seem to reflectthe extensive, lineage-specific gene loss that is apparentlycharacteristic <strong>of</strong> eukaryotic evolution.A PARSIMONIOUS SCENARIO OF GENE LOSSAND EMERGENCE IN EUKARYOTICEVOLUTION AND RECONSTRUCTION OFANCESTRAL EUKARYOTIC GENE SETSAssuming a particular species tree topology, methods<strong>of</strong> evolutionary parsimony analysis can be employed toconstruct a parsimonious scenario <strong>of</strong> evolution; i.e., map-ping <strong>of</strong> different types <strong>of</strong> evolutionary events onto thebranches <strong>of</strong> the tree such that the total number <strong>of</strong> theevents is minimal. As discussed above, with prokaryotes,the problem is confounded by the fact that both lineagespecificgene loss and HGT apparently have made majorcontributions to genome evolution, with the relative rates<strong>of</strong> these processes remaining unknown (Snel et al. 2002;Mirkin et al. 2003). Since HGT between major lineages<strong>of</strong> eukaryotes apparently can be safely disregarded, anunambiguous parsimonious scenario including only geneloss and emergence <strong>of</strong> new genes as elementary eventscan be constructed. <strong>The</strong> phylogenetic tree <strong>of</strong> the eukaryoticcrown group seems to have been established withnear complete confidence. In particular, some conflictingobservations notwithstanding, the majority <strong>of</strong> phylogeneticstudies point to an animal–fungal clade, grouping <strong>of</strong>microsporidia with the fungi, and a coelomate (chordate–arthropod)clade among the animals (Blair et al.2002; Hedges 2002; Y.I. Wolf et al., unpubl.). Assumingthis tree topology and treating the phyletic pattern <strong>of</strong> eachKOG as a string <strong>of</strong> binary characters (1 for the presence<strong>of</strong> the given species and 0 for its absence in the givenKOG), the parsimonious scenario <strong>of</strong> gene loss and emergenceduring the evolution <strong>of</strong> the eukaryotic crown groupwas constructed. For this reconstruction, the Dollo parsimonyapproach was adopted (Farris 1977). Under this approach,gene loss is considered irreversible; i.e., a gene (aKOG member) can be lost independently in several evolutionarylineages but cannot be regained. <strong>The</strong> use <strong>of</strong> thisassumption is justified by the negligible rate <strong>of</strong> HGT betweeneukaryotes (the Dollo approach is not valid for thereconstruction <strong>of</strong> prokaryotic ancestors).In the resulting scenario, each branch was associatedwith a unique number <strong>of</strong> losses and gains, with the exception<strong>of</strong> the plant branch and the branch leading to thecommon ancestor <strong>of</strong> the fungi–microsporidial clade andthe animals, to which gene losses could not be assignedwith the current set <strong>of</strong> genomes (Fig. 3). Undoubtedly,once genomes <strong>of</strong> early-branching eukaryotes are included,gene loss associated with these branches will becomeapparent. <strong>The</strong> reconstructed scenario includes massivegene loss in the fungal clade, with additional85819472711881421109Figure 2. <strong>The</strong> phyletic patterns <strong>of</strong> KOGs. (All-Ec) KOGs represented in all analyzed genomes with the exception <strong>of</strong> Encephalitozooncuniculi; (All animals, All fungi) KOGs represented exclusively in the three animal species or the two fungal species, respectively.921186AllAll-EcAnimals+FungiPlant+fungiPlant+animalsAll animalsAll fungiOther patterns

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!