27.10.2014 Views

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

624 Chapter 20<br />

TABLE 20.14<br />

Infant Mortality by TFR for 10 Countries from Table 19.8<br />

INFMORT<br />

TFR<br />

State x y<br />

Armenia 26 1.70<br />

Chad 112 6.07<br />

El Salvador 32 3.17<br />

Ghana 66 5.15<br />

Iran 35 2.80<br />

Latvia 18 1.25<br />

Namibia 65 4.90<br />

Panama 21 2.63<br />

Slovenia 7 1.26<br />

Sur<strong>in</strong>ame 29 2.21<br />

Mean of x 41.10 Mean of y 3.114<br />

countries. Your best guess—your lowest prediction error—would be the mean,<br />

3.114 children per woman. You can see this <strong>in</strong> figure 20.4 where I’ve plotted<br />

the distribution of TFR and INFMORT for the 10 countries <strong>in</strong> table 20.14.<br />

The Sums of the Squared Distances to the Mean<br />

Each dot <strong>in</strong> figure 20.4 is physically distant from the dotted mean l<strong>in</strong>e by a<br />

certa<strong>in</strong> amount. The sum of the squares of these distances to the mean l<strong>in</strong>e is<br />

the smallest sum possible (that is, the smallest cumulative prediction error you<br />

could make), given that you only know the mean of the dependent variable.<br />

The distances from the dots above the l<strong>in</strong>e to the mean are positive; the distances<br />

from the dots below the l<strong>in</strong>e to the mean are negative. The sum of the<br />

actual distances is zero. Squar<strong>in</strong>g the distances gets rid of the negative numbers.<br />

But suppose you do know the data <strong>in</strong> table 20.14 regard<strong>in</strong>g the <strong>in</strong>fant mortality<br />

rate for each of those 10 countries. Can you reduce the prediction error<br />

<strong>in</strong> guess<strong>in</strong>g the TFR for those countries? Could you draw another l<strong>in</strong>e through<br />

figure 20.4 that ‘‘fits’’ the dots better and reduces the sum of the distances<br />

from the dots to the l<strong>in</strong>e?<br />

You bet you can. The solid l<strong>in</strong>e that runs diagonally through the graph <strong>in</strong><br />

figure 20.4 m<strong>in</strong>imizes the prediction error for these data. This l<strong>in</strong>e is called<br />

the best fitt<strong>in</strong>g l<strong>in</strong>e, or the least squares l<strong>in</strong>e, or the regression l<strong>in</strong>e. When<br />

you understand how this regression l<strong>in</strong>e is derived, you’ll understand how correlation<br />

works.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!