15.12.2012 Views

Bioinformatics, Volume I Data, Sequence Analysis and Evolution

Bioinformatics, Volume I Data, Sequence Analysis and Evolution

Bioinformatics, Volume I Data, Sequence Analysis and Evolution

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

A<br />

C<br />

Phylogenetic Model Evaluation 349<br />

Whelan (98). When the hypotheses are not nested, it is necessary<br />

to use the parametric bootstrap (96, 99), in which case pseudodata<br />

are generated under H 0 . In some instances, permutation<br />

tests may be used instead (100).<br />

The evolution of protein-coding genes <strong>and</strong> RNA-coding<br />

genes is likely to differ due to the structural <strong>and</strong> functional constraints<br />

of their gene products, so to determine whether sites<br />

in an alignment of such genes evolved under independent <strong>and</strong><br />

identical conditions, it is necessary to draw on knowledge of the<br />

structure <strong>and</strong> function of these genes <strong>and</strong> their gene products.<br />

For example, although a protein-coding gene could be regarded<br />

as a sequence of independently evolving sites (Fig. 16.4A), it<br />

would be more appropriate to consider it as a sequence of<br />

independently evolving codons (Fig. 16.4B) or a sequence of independently<br />

evolving codon positions, with sites in the same codon<br />

position evolving under identical <strong>and</strong> independent conditions<br />

Gene ATGAACGAAAATCTGTTCGCTTCATTCATTGCCCCCACAATCCTAGGCCTACCCGCCGCA<br />

Unit<br />

B<br />

Gene ATGAACGAAAATCTGTTCGCTTCATTCATTGCCCCCACAATCCTAGGCCTACCCGCCGCA<br />

Unit<br />

Gene ATGAACGAAAATCTGTTCGCTTCATTCATTGCCCCCACAATCCTAGGCCTACCCGCCGCA<br />

Unit<br />

Category 123123123123123123123123123123123123123123123123123123123123<br />

D<br />

Gene ATGAACGAAAATCTGTTCGCTTCATTCATTGCCCCCACAATCCTAGGCCTACCCGCCGCA<br />

Unit<br />

Category 123123123123456456456456456456456456456456456789789789789789<br />

E<br />

Gene ATGAACGAAAATCTGTTCGCTTCATTCATTGCCCCCACAATCCTAGGCCTACCCGCCGCA<br />

Unit 1<br />

Unit 2<br />

Fig. 16.4. Models used to describe the relationship among sites in protein-coding genes. A protein-coding gene may be<br />

regarded as a sequence of independently evolving units, where each unit is (A) a site, (B) a codon, or (C) a site assigned<br />

its own evolutionary model, depending on its position within a codon. The more complex models include those that<br />

consider (D) information about the gene product’s structure <strong>and</strong> function. (Here, categories 1, 2, <strong>and</strong> 3 correspond to<br />

models assigned to sites within the codons that encode amino acids in one structural domain, categories 4, 5, <strong>and</strong> 6<br />

correspond to models assigned to sites in codons that encode amino acids in another structural domain, <strong>and</strong> so forth.)<br />

(E) Overlapping reading frames. (Here, unit 1 corresponds to one reading frame of one gene whereas unit 2 corresponds<br />

to that of the other gene.)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!