09.12.2012 Views

Principles of Plant Genetics and Breeding

Principles of Plant Genetics and Breeding

Principles of Plant Genetics and Breeding

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

averages, to the more complex multivariate analysis.<br />

Computers are required for complex analyses, but<br />

sometimes, the breeder may have a small amount <strong>of</strong><br />

data <strong>and</strong> might want to use a h<strong>and</strong>held calculator for<br />

quick results. Hence, there is the need to know the<br />

computational basis <strong>of</strong> the commonly used statistical<br />

methods.<br />

Population versus sample<br />

A statistical population is the totality <strong>of</strong> the units (individuals)<br />

<strong>of</strong> interest to the researcher. It follows then<br />

that, depending on the researcher’s objectives, a population<br />

may be small or infinitely large. A small population<br />

can be measured in its entirety. <strong>Plant</strong> breeders <strong>of</strong>ten<br />

h<strong>and</strong>le large populations, <strong>and</strong> obtaining measurement<br />

from the entire population is <strong>of</strong>ten impractical. Instead,<br />

researchers obtain measurements from a subset <strong>of</strong><br />

the population, called a sample. The scores from the<br />

sample are used to infer or estimate the scores we would<br />

expect to find if it were possible to measure the entire<br />

population.<br />

In order to draw accurate conclusions about the<br />

population, the sample must be representative <strong>of</strong><br />

the population. To obtain a representative sample, the<br />

statistical technique <strong>of</strong> r<strong>and</strong>om sampling (in which all<br />

possible scores in the population have an equal chance<br />

<strong>of</strong> being selected for a sample) is used. There are other<br />

methods <strong>of</strong> drawing samples from a population for a<br />

variety <strong>of</strong> purposes. These include quota, convenience,<br />

<strong>and</strong> stratified sampling methods. A number that<br />

describes a characteristic <strong>of</strong> a population is called a<br />

sample statistic or simply, a statistic. A number that<br />

describes a population is called a population parameter,<br />

or simply, a parameter.<br />

Issue <strong>of</strong> causality<br />

Scientific conclusions are drawn from the preponderance<br />

<strong>of</strong> the evidence obtained from properly conducted<br />

research. Cause <strong>and</strong> effect is implicit in the logic <strong>of</strong><br />

researchers. However, it is difficult to definitely prove<br />

that variable X causes variable Y. There is always the<br />

possibility that some unknown variable is actually<br />

responsible for the effect observed (change in scores).<br />

No statistical procedure will prove that one variable<br />

causes another variable to change (i.e., statistics does<br />

not prove anything!). An experiment provides evidence<br />

to argue for a certain point <strong>of</strong> view, not prove it.<br />

COMMON STATISTICAL METHODS IN PLANT BREEDING 147<br />

Statistical hypothesis<br />

A hypothesis is an informed conjecture (educated<br />

guess) about a phenomenon. It is arrived at after taking<br />

into account pertinent scientific knowledge <strong>and</strong> personal<br />

experience. Researchers <strong>of</strong>ten have preconceived<br />

ideas about the phenomenon that they seek to investigate.<br />

However, they should be willing to approach an<br />

investigation with an open mind. A hypothesis declares<br />

the prediction <strong>of</strong> the researcher concerning the relationship<br />

between two or more variables associated with the<br />

study. An experiment is designed to test this relationship.<br />

In plant breeding, a breeder ends up with about a<br />

dozen promising genotypes from which one would<br />

eventually be selected for release to farmers for use in<br />

cultivation. The breeder conducts field tests or trials<br />

(over locations <strong>and</strong> years) to help in the decision-making<br />

process. He or she suspects or predicts differences<br />

among these genotypes. The predicted difference represents<br />

a true phenomenon. To avoid any biases, the<br />

hypothesis is formulated in the opposite direction to the<br />

predicted or suspected outcome. That is, the researcher<br />

would state that no real differences exist among the<br />

genotypes (i.e., any differences are due to chance).<br />

This is the null hypothesis (H 0 ) or the hypothesis <strong>of</strong><br />

no difference. The alternative hypothesis (H 1 ) would<br />

indicate a real difference exists. There is a st<strong>and</strong>ard way<br />

<strong>of</strong> mathematically stating a hypothesis. If four genotypes<br />

were being evaluated, a hypothesis could be formulated<br />

as follows:<br />

Null hypothesis, H 0 : µ 1 =µ 2 =µ 3 =µ 4<br />

(i.e., all genotype means are equal)<br />

Alternate hypothesis, H 1 : µ 1 ≠µ 2 ≠µ 3 ≠µ 4<br />

(i.e., all genotype means are not equal)<br />

The H 0 is accepted (i.e., automatically reject H 1 ), or<br />

rejected (accept H 0 ), at a chosen level <strong>of</strong> statistical<br />

significance, α (e.g., α =0.01 or 0.05; acknowledging<br />

that 1% or 5% <strong>of</strong> the time you could be mistaken in<br />

your conclusion). In other words, the research does not<br />

prove anything outright, as previously pointed out.<br />

Rejecting H 0 when it is true (i.e., you are saying a difference<br />

exists when in fact none really does) is called a<br />

Type I error. On the other h<strong>and</strong>, failure to find a true<br />

difference when it exists (accepting a false H 0 ) is called a<br />

Type II error.<br />

The goal <strong>of</strong> a plant breeder is to conduct research in<br />

such a way that true differences, when they exist, are<br />

observed. This depends on the adoption <strong>of</strong> sound<br />

experimental procedures, <strong>of</strong>ten called field plot techniques,<br />

<strong>and</strong> is discussed later in this chapter.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!