27.10.2014 Views

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Bivariate Analysis: Test<strong>in</strong>g Relations 639<br />

really want to understand what makes the data messy <strong>in</strong> the first place? In<br />

general, you cannot achieve understand<strong>in</strong>g of mess<strong>in</strong>ess by clean<strong>in</strong>g th<strong>in</strong>gs<br />

up. Still, as <strong>in</strong> all aspects of research, be ready to break this rule, too, when<br />

you th<strong>in</strong>k you’ll learn someth<strong>in</strong>g by do<strong>in</strong>g so.<br />

2500<br />

1200<br />

VIOL97<br />

2000<br />

1500<br />

1000<br />

500<br />

Pearson r<br />

Influence<br />

0.10<br />

0.09<br />

0.08<br />

0.07<br />

0.06<br />

0.05<br />

0.04<br />

0.03<br />

0.02<br />

0.01<br />

0.00<br />

VIOL97<br />

1000<br />

800<br />

600<br />

400<br />

200<br />

0<br />

20000 30000 40000 50000<br />

AVGPAY97<br />

a. b.<br />

0<br />

20000 25000 30000 35000 40000<br />

AVGPAY97<br />

Violent crimes <strong>in</strong> the United States (for all<br />

states and Wash<strong>in</strong>gton, D.C.), per 100,000<br />

population, by average annual pay, 1997.<br />

Figure 20.9. The effect of outliers.<br />

Violent crimes <strong>in</strong> the United States (for all<br />

states, without Wash<strong>in</strong>gton, D.C.), per<br />

100,000 population, by average annual pay,<br />

1997.<br />

Figure 20.9a shows the relation between (1) the number of violent crimes<br />

per 100,000 people <strong>in</strong> each of the 50 U.S. states and Wash<strong>in</strong>gton, D.C., <strong>in</strong><br />

1997 and (2) the average annual pay for people <strong>in</strong> the 50 states plus Wash<strong>in</strong>gton,<br />

D.C. Figure 20.9b shows exactly the same th<strong>in</strong>g, but without <strong>in</strong>clud<strong>in</strong>g<br />

the data from Wash<strong>in</strong>gton, D.C. The correlation between the two variables<br />

with D.C. <strong>in</strong> the picture is .44. When we leave out the data for D.C., the correlation<br />

s<strong>in</strong>ks to .19.<br />

That’s because the violent crime rate <strong>in</strong> Wash<strong>in</strong>gton, D.C., was an appall<strong>in</strong>g<br />

2,024 per 100,000 population <strong>in</strong> 1997, and the average annual pay there was<br />

$46,775. For the 50 states <strong>in</strong> the United States, the next highest violent crime<br />

rate, <strong>in</strong> Florida, was half that of D.C., and the next highest average pay, <strong>in</strong><br />

Connecticut, was 17% lower than that <strong>in</strong> D.C.<br />

The sizes of the dots <strong>in</strong> figures 20.9a and b <strong>in</strong>dicate the <strong>in</strong>fluence each data<br />

po<strong>in</strong>t has on the correlation—that is, how much the correlation would change<br />

if you took each data po<strong>in</strong>t out of the calculation. The legend for this little<br />

trick is between the figures. That huge circle <strong>in</strong> the upper-right of figure 20.9a<br />

is Wash<strong>in</strong>gton, D.C. To repeat: Pearson’s r for the two variables <strong>in</strong> figure 20.9a<br />

is 0.442 and for figure 20.9b, without D.C., it’s 0.190. Now, that’s an outlier.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!