09.12.2012 Views

Principles of Plant Genetics and Breeding

Principles of Plant Genetics and Breeding

Principles of Plant Genetics and Breeding

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Coefficient <strong>of</strong> variation<br />

The coefficient <strong>of</strong> variation is a measure <strong>of</strong> the relative<br />

variability <strong>of</strong> given populations. Variance estimates have<br />

units attached to them. Consequently, it is not possible<br />

to compare population measurements <strong>of</strong> different units<br />

(i.e., comparing apples with oranges). For example, one<br />

population may be measured in kilograms (e.g., yield),<br />

while another is measured in centimeters or feet (e.g.,<br />

plant height). A common application <strong>of</strong> variance is the<br />

test to find out if one biological sample is more variable<br />

for one trait than for another (e.g., is plant height in soybean<br />

more variable than the number <strong>of</strong> pods per plant?).<br />

Larger organisms usually vary more than smaller ones.<br />

Similarly, traits with larger means tend to vary more<br />

than those with smaller means. For example, grain yield<br />

per hectare <strong>of</strong> a cultivar (in kg/ha) will vary more than<br />

its 100 seed weight (in grams). For these <strong>and</strong> other<br />

enquiries, the coefficient <strong>of</strong> variation facilitates the comparison<br />

because it is unit free.<br />

The coefficient <strong>of</strong> variation (CV) is calculated as:<br />

CV = (s/X ¯ ) × 100<br />

For the number <strong>of</strong> leaves per plant example:<br />

CV = 1.37/7.9 = 0.173<br />

= 17.3%<br />

A CV <strong>of</strong> 10% or less is generally desirable in biological<br />

experiments.<br />

St<strong>and</strong>ard error <strong>of</strong> the mean<br />

The st<strong>and</strong>ard error measures the amount <strong>of</strong> variability<br />

among individual units in a population. If several samples<br />

are taken from one population, the individuals will<br />

vary within samples as well as among samples. The st<strong>and</strong>ard<br />

error <strong>of</strong> the mean (SE) measures the variability<br />

among different sample means taken from a population.<br />

It is computed as:<br />

s x = s/√n<br />

For the number <strong>of</strong> leaves per plant example:<br />

s x = 1.37/√10<br />

= 0.433<br />

The st<strong>and</strong>ard error <strong>of</strong> the mean indicates how precisely<br />

the population parameter has been estimated. It<br />

COMMON STATISTICAL METHODS IN PLANT BREEDING 151<br />

may be attached to the mean in the presentation <strong>of</strong><br />

results in a publication (e.g., for the leaves per plant<br />

example, it will be 7.9 ± 0.43).<br />

Simple linear correlation<br />

<strong>Plant</strong> breeders are not only interested in variability as<br />

regards a single characteristic <strong>of</strong> a population, but <strong>of</strong>ten<br />

they are interested in how multiple characteristics <strong>of</strong><br />

the units <strong>of</strong> a population associate. If there is no association,<br />

covariance will be zero or close to zero. The<br />

magnitude <strong>of</strong> covariance is <strong>of</strong>ten related to the size <strong>of</strong><br />

the variables themselves, <strong>and</strong> also depends on the scale<br />

<strong>of</strong> measurement.<br />

The simple linear correlation measures the linear<br />

relationship between two variables. It measures a joint<br />

property <strong>of</strong> two variables. <strong>Plant</strong> breeding is facilitated<br />

when desirable genes are strongly associated on the<br />

chromosome. The relationship <strong>of</strong> interest in correlation<br />

is not based on cause <strong>and</strong> effect. The degree (closeness<br />

or strength) <strong>of</strong> linear association between variables is<br />

measured by the correlation coefficient. The correlation<br />

coefficient is free <strong>of</strong> scale <strong>and</strong> measurement, <strong>and</strong> has<br />

values that lie between +1 <strong>and</strong> −1 (i.e., correlation can<br />

be positive or negative). If there is no linear association<br />

between variables, the correlation is zero. However, a<br />

lack <strong>of</strong> significant linear correlation does not mean there<br />

is no association (the association could be non-linear or<br />

curvilinear).<br />

The population correlation coefficient (ρ) is given by:<br />

ρ=σ 2 XY /√(σ X 2 ×σ Y 2 )<br />

where σ X 2 = variance <strong>of</strong> X, σY 2 = variance <strong>of</strong> Y, <strong>and</strong> σ 2 XY =<br />

covariance <strong>of</strong> X <strong>and</strong> Y. The sample covariance is<br />

called the Pearson correlation coefficient (r) <strong>and</strong> is<br />

calculated as:<br />

r = s 2 XY /√(s 2 X × s 2 Y )<br />

where s 2 XY = sample covariance <strong>of</strong> X <strong>and</strong> Y, s 2 X = sample<br />

variance <strong>of</strong> X, <strong>and</strong> s 2 Y = sample variance <strong>of</strong> Y.<br />

The computational formula is:<br />

r = [N(∑XY ) − (∑X)(∑Y )]/<br />

√[N(∑X 2 ) − (∑X) 2 ][N(∑Y 2 ) − (∑Y ) 2 ]<br />

The data in Table 9.3 shows the seed oil <strong>and</strong> protein<br />

content <strong>of</strong> 10 soybean cultivars. The calculation yields<br />

the following results:

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!