05.02.2013 Views

plant surface microbiology.pdf

plant surface microbiology.pdf

plant surface microbiology.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

7 Statistical analysis<br />

28 Analysis of Microbial Population Genetics 561<br />

Introduction<br />

The studies of microbial population genetics with molecular methods are<br />

often characterised by an extremely high number of samples and by a high<br />

number of molecular markers. As a consequence, an immediate interpretation<br />

of the results can be difficult unless powerful statistical techniques are<br />

used in order to describe the structure of the populations and to highlight the<br />

contributions of its components (Mengoni and Bazzicalupo 2002).<br />

Methods and Procedure<br />

Statistical treatment of data in microbial population genetics include at least<br />

four different levels of analysis:<br />

1. Quantification of the genetic diversity within the population<br />

2. Measurement of genetic distances between strains<br />

3. Analysis of the genetic structure<br />

4. Analysis of the genetic relationships among populations.<br />

Several methods can be used to address each of these points. Here, a brief<br />

summary of the principal parameters and software used is provided.<br />

The molecular data obtained from RAPD,AFLP, and ITS-RFLP analyses are<br />

usually bands in a gel or peaks in a chromatogram. These data are transformed<br />

into a matrix of state binary vectors (molecular haplotype) for each<br />

individual isolate using a compiler such as Microsoft Excel or similar. Bands<br />

and peaks of equal sizes are interpreted as identical and intensity is not considered<br />

as a difference. The molecular haplotype of each isolate is expressed<br />

as a vector of zeroes (for the absence of the band) or ones (for its presence),<br />

assuming that bands represent independent loci.<br />

1. The quantification of genetic diversity within the population can be done<br />

using several parameters. The most commonly used are the gene diversity,<br />

the average gene diversity over loci and the mean number of pairwise differences<br />

between haplotypes.<br />

The gene diversity is equivalent to the expected heterozygosity for diploid<br />

data. It is defined as the probability that two randomly chosen molecular<br />

haplotypes are different in the sample.<br />

The average gene diversity over loci is defined as the probability that two<br />

individuals are different for a randomly chosen locus. These two parameters<br />

vary from 0 (all isolates identical) to 1 (maximum diversity).<br />

The mean number of pair-wise differences simply calculates the mean<br />

number of differences between all pairs of molecular haplotypes in the<br />

population. The computation of these three parameters is performed with<br />

Arlequin software (Schneider et al. 1997).<br />

2. For the measurement of the genetic distances between strains, several<br />

methods can be applied. The basic principle is the ratio of bands shared by

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!