Phylogeny and molecular evolution of green algae - Phycology ...

More documents

Recommendations

Info

INTRODUCTION 15 Figure 9. Flow chart for accurate phylogenetic reconstruction. A. Phylogenetic reconstruction. B. Removal of fast-evolving sites. Selection of the optimal partitioning strategy and model A great number of models of sequence evolution have been described, ranging from simple models to complex models incorporating a lot of parameters. To reconstruct an accurate phylogenetic tree it is important to select a model of sequence evolution that approximates the evolutionary history of genes under study. A number of criteria have been developed to evaluate the fit of the different models to the data. The Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) are two of these criteria that can be used for selection of the optimal partitioning strategy and evolutionary model (Figure 9A). Whereas AIC only penalizes for the number of model parameters, BIC also incorporates alignment length and thus penalizes a situation in which many parameters have to be estimated from a small dataset. In other words for the same dataset, AIC will prefer a more complex model than BIC. Because Bayesian analysis appears to be more sensitive to model underspecifications than ML, some authors have suggested that AIC scores can be used to choose a complex model for Bayesian analysis and BIC to choose a less complex model for ML analysis (Verbruggen and Theriot 2008). AIC and BIC calculation starts from a guide tree which is inferred with a fast distance based method (e.g. NJ) or a fast ML search under a simple model (e.g. PhyML,
16 CHAPTER 1 Treefinder). In the second step log likelihoods of the guide tree under different partitioning strategies and models are calculated. Subsequently, the corresponding AIC or BIC scores are calculated and compared. The condition with the lowest AIC and/or BIC score is chosen for phylogenetic analysis. Alternatively, Bayes factors (Nylander et al. 2004) can be used to compare different partitioning strategies and models. For each tested condition a separate Bayesian analyses has to be run which implies high computational times. This makes it unrealistic to compare many partitioning strategies and models in a Bayesian framework. Complex models of sequence evolution The secondary structure of ribosomal RNA consists of loops and stems. The nucleotides in the stem regions form base pairs and are interdependent because a change on one side of the stem has to be compensated in the other side of stem to avoid malfunction of the molecule. Since models of sequence evolution have to approach real evolution as close by as possible, it is recommended to incorporate this site interdependence in the model. This can be done by partitioning the ribosomal RNA into loops and stems and using a doublet model for the stem regions (Schöniger and Von Haeseler 1994). However, the use of a doublet model is computational demanding. Instead of partitioning protein coding genes into codon positions, a codon substitution model can be applied. In this model, nucleotide triplets are considered as a single character and changes from one triplet to another one are considered taking into account that some changes are more likely than others (e.g. synonymous versus non-synonymous substitution). Although codon substitution models are a more realistic approximation of protein sequence evolution than codon position models, they come with a very high computational cost, hindering their use for large datasets (Shapiro et al. 2006). Mixture models Mixture models offer an attractive alternative to data partitioning and applying different models to the partitions. Whereas a partitioned analysis assumes that all sites within a partition arose from a single evolutionary process, mixture models relax this assumption by not expecting any prior partitioning and applying a set of different models to each site in the alignment. The log likelihood of each site is calculated as a weighted sum of the log likelihoods of each model for that site. The model weights correspond to the probability that the site has evolved under the model in question. Mixture models can thus apply different rate matrices to different parts of the dataset without explicitly partitioning it (Pagel and Meade 2004, Venditti et al. 2008). This is an elegant way to incorporate across site heterogeneity in the evolutionary process because it does not require prior knowledge about differences of evolutionary processes between different parts of the dataset and it avoids problems associated with differences of the evolutionary process within partitions that are defined a priori. Although analyses using mixture models outperform analyses based on partitioned datasets, they are restrictively time-consuming for large datasets.
Page 1 and 2: Phylogeny and molecular evolution o
Page 3 and 4: Promotor: Prof. Dr. O. De Clerck (U
Page 5 and 6: Aan de mensen van de plantkunde in
Page 8 and 9: 1 Introduction Algae Algae are a la
Page 10 and 11: Green lineage or Viridiplantae INTR
Page 12 and 13: INTRODUCTION 5 Figure 4. Variation
Page 14 and 15: INTRODUCTION 7 (shared gene losses
Page 16 and 17: INTRODUCTION 9 Sphaeropleales (dire
Page 18 and 19: INTRODUCTION 11 Figure 7. The estim
Page 20 and 21: Tree building methods Maximum likel
Page 24 and 25: Removal of fast-evolving sites INTR
Page 26 and 27: 2 Ancient relationships among green
Page 28 and 29: PHYLOGENY OF GREEN ALGAE 21 environ
Page 30 and 31: PHYLOGENY OF GREEN ALGAE 23 Our phy
Page 32 and 33: PHYLOGENY OF GREEN ALGAE 25 Figure
Page 34 and 35: PHYLOGENY OF GREEN ALGAE 27 to the
Page 36 and 37: PHYLOGENY OF GREEN ALGAE 29 primers
Page 38 and 39: Topological hypothesis testing PHYL
Page 40 and 41: Additional files PHYLOGENY OF GREEN
Page 42 and 43: PHYLOGENY OF GREEN ALGAE 35 Figure
Page 44 and 45: actin G6PI GapA histone OEE1 40S_S9
Page 46 and 47: PHYLOGENY OF GREEN ALGAE 39 atpB rb
Page 48 and 49: PHYLOGENY OF GREEN ALGAE 41 Table S
Page 50 and 51: 3 Gain and loss of elongation facto
Page 52 and 53: GAIN AND LOSS OF ELONGATION FACTOR
Page 60 and 61: Methods Algal strains GAIN AND LOSS
Page 64 and 65: Authors' contributions GAIN AND LOS
Page 66 and 67: Additional file 2 GAIN AND LOSS OF
Page 68 and 69: Additional file 4 GAIN AND LOSS OF
Page 70 and 71: Additional file 6 Table S2. GenBank
Page 72 and 73:
atpB rbcL SSU rDNA EF-1α EFL Chlor
Page 74 and 75:
atpB rbcL SSU rDNA EF-1α EFL choan
Page 76 and 77:
4 Complex phylogenetic distribution
Page 78 and 79:
NON-CANONICAL GENENTIC CODE 71 addi
Page 80 and 81:
Figure 1. The occurrence of a non-c
Page 82 and 83:
NON-CANONICAL GENENTIC CODE 75 Figu
Page 84 and 85:
Multiple independent gains NON-CANO
Page 86:
Acknowledgements NON-CANONICAL GENE
Page 89 and 90:
82 CHAPTER 5 Introduction The genet
Page 91 and 92:
84 CHAPTER 5 The goal of this study
Page 93 and 94:
86 CHAPTER 5
Page 95 and 96:
88 CHAPTER 5 Codon usage bias and G
Page 98 and 99:
6 A multi-locus time-calibrated phy
Page 100 and 101:
A MULTI-LOCUS TIME-CALIBRATED PHYLO
Page 102 and 103:
Page 104 and 105:
Page 106 and 107:
Time-calibrated phylogeny A MULTI-L
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
Page 114 and 115:
Acknowledgments A MULTI-LOCUS TIME-
Page 116 and 117:
Dichotomosiphon tuberosus AB038487
Page 118 and 119:
Table 3. List of calibration points
Page 120:
Page 123 and 124:
116 CHAPTER 7 Introduction Green al
Page 125 and 126:
118 CHAPTER 7 ulvophycean order Ulo
Page 127 and 128:
120 CHAPTER 7 10). Filaments that w
Page 129 and 130:
122 CHAPTER 7 invaded freshwater ha
Page 131 and 132:
124 CHAPTER 7 filaments, and the po
Page 133 and 134:
126 CHAPTER 7 Type species: Okellya
Page 136 and 137:
8 General discussion This thesis fo
Page 138 and 139:
SSU nrDNA phylogenies GENERAL DISCU
Page 140 and 141:
GENERAL DISCUSSION 133 copy nuclear
Page 142 and 143:
GENERAL DISCUSSION 135 Figure 2. Su
Page 144 and 145:
GENERAL DISCUSSION 137 Our site str
Page 146 and 147:
GENERAL DISCUSSION 139 In the light
Page 148:
GENERAL DISCUSSION 141 genomes (Rok
Page 151 and 152:
144 REFERENCES Bartsch I and Kuhlen
Page 153 and 154:
146 REFERENCES Derelle E, Ferraz C,
Page 155 and 156:
148 REFERENCES Hanyuda T, Wakana I,
Page 157 and 158:
150 REFERENCES Knight RD, Freeland
Page 159 and 160:
152 REFERENCES Mattox K and Stewart
Page 161 and 162:
154 REFERENCES Philippe H, Lartillo
Page 163 and 164:
156 REFERENCES Sanderson MJ. 2002.
Page 165 and 166:
158 REFERENCES Turmel M, Otis C, an
Page 167 and 168:
160 REFERENCES Zechman FW, Theriot
Page 169 and 170:
162 SUMMARY derived from a multinuc
Page 172 and 173:
Samenvatting Groenwieren worden wer
Page 174 and 175:
SAMENVATTING 167 kleine variaties o
show all

Phylogeny and molecular evolution of green algae - Phycology ...

Create successful ePaper yourself

Delete template?

Save as template?