02.03.2013 Views

Thinking and Deciding

Thinking and Deciding

Thinking and Deciding

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

184 JUDGMENT OF CORRELATION AND CONTINGENCY<br />

Table 8.1: Height <strong>and</strong> weight for five children<br />

Child Height (in.) Weight (lb.)<br />

A 48 45<br />

B 43 35<br />

C 51 55<br />

D 46 44<br />

E 45 40<br />

correlated with weight among people, we mean that taller people tend to weigh<br />

more than shorter people. As one figure goes up, so does the other, <strong>and</strong> vice versa.<br />

We can measure the association between two different quantities on a scale of 1<br />

to −1 by a correlation coefficient, abbreviated r. (Theformulaforr is found in most<br />

statistics texts.) The coefficient r measures the extent to which one variable can be<br />

predicted as a linear function of the other. A correlation coefficient of 1 indicates<br />

that one measure is a perfect linear function of another, or has a perfect positive<br />

correlation with it. An example is the relationship between degrees Fahrenheit (F)<br />

<strong>and</strong> degrees Celsius (C): If we know one of these, we can calculate the other exactly<br />

by a simple linear formula, F =1.8 · C +32. A correlation of 0 indicates no linear<br />

relationship at all. In a long sequence of dice rolls, for example, there ought to be<br />

no correlation between the number thrown on one roll of the dice <strong>and</strong> the number<br />

thrown on the next roll. A correlation of −1 indicates a perfect negative correlation;<br />

for example, there is a perfect negative correlation between a person’s height <strong>and</strong><br />

the distance of the person’s head from the ceiling of the room I am now in. The<br />

correlation coefficient for the height column <strong>and</strong> the weight column in Table 14.1 is<br />

0.98, very close to 1. (Again, consult a statistics book for the formula for r.)<br />

The correlation coefficient is only one among many reasonable measures of correlation.<br />

It has certain convenient mathematical properties, <strong>and</strong> it can be shown to be<br />

the best measure for achieving certain goals. These issues are discussed in statistics<br />

books. For our purposes, the most important property of the correlation coefficient<br />

is that it treats all observations equally. For example, it does not weigh more heavily<br />

those pairs of numbers that are consistent with the existence of a positive correlation,<br />

as opposed to a negative or a 0 correlation.<br />

We can investigate correlations, or associations, when each variable concerns<br />

membership in a category. (Such variables are called dichotomous because membership<br />

<strong>and</strong> nonmembership constitute a dichotomy. Another term is “binary.”) We<br />

may ask, for example, whether “presence of blue eyes” (membership in the category<br />

of blue-eyed people) is correlated with “presence of blond hair” in a population of<br />

people. To calculate the correlation, we can assign a 1 to “presence” of each variable<br />

(blue eyes or blond hair) <strong>and</strong> a 0 to “absence” of each variable for each person

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!