27.10.2014 Views

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Multivariate Analysis 663<br />

which means that we need to f<strong>in</strong>d a separate constant—one called b 1 and one<br />

called b 2 —by which to multiply each of the two <strong>in</strong>dependent variables. The<br />

general formula for multiple regression is:<br />

y a b 1 x 1 b 2 x 2 ... b n x n Formula 21.4<br />

Recall that simple regression yields a PRE measure, r 2 . It tells you how much<br />

better you can predict a series of measures of a dependent variable than you<br />

could by just guess<strong>in</strong>g the mean for every measurement. Multiple regression<br />

is also a PRE measure. It, too, tells you how much better you can predict measures<br />

of a dependent variable than you could if you guessed the mean—but<br />

us<strong>in</strong>g all the <strong>in</strong>formation available <strong>in</strong> a series of <strong>in</strong>dependent variables.<br />

The key to regression are those b coefficients, orweights, <strong>in</strong> Formula 21.4.<br />

We want coefficients that, when multiplied by the <strong>in</strong>dependent variables, produce<br />

the best possible prediction of the dependent variable—that is, we want<br />

predictions that result <strong>in</strong> the smallest possible residuals. Those coefficients,<br />

by the way, are not existential constants. Remember that samples yield statistics,<br />

which only estimate parameters. Your statistics change with every sample<br />

you take and with the number of <strong>in</strong>dependent variables <strong>in</strong> the equation.<br />

The MVD-TEENBIRTH Puzzle<br />

Let’s try to solve the puzzle of the relation between teenage births and the<br />

rate of motor vehicle deaths. We can take a stab at this us<strong>in</strong>g multiple regression<br />

by try<strong>in</strong>g to predict the rate of teenage births without the data from motor<br />

vehicle deaths.<br />

From our previous analyses with elaboration tables and with partial correlation,<br />

we already had an idea that <strong>in</strong>come might have someth<strong>in</strong>g to do with the<br />

rate of teen births. We know from the literature (Handwerker 1998) that poverty<br />

is associated with violence and with teenage pregnancy, and that this is<br />

true across ethnic groups, so I’ve added a variable on violent crimes that I<br />

th<strong>in</strong>k might be a proxy for the amount of violence aga<strong>in</strong>st persons <strong>in</strong> each of<br />

the states.<br />

Table 21.16 shows the correlation matrix for four variables: TEENBIRTH<br />

(the percentage of births to teenagers <strong>in</strong> the 50 U.S. states dur<strong>in</strong>g 1996);<br />

INCOME (the mean per capita <strong>in</strong>come for each of the 50 states dur<strong>in</strong>g 1996);<br />

VIOLRATE (the rate of violent crime—rape, murder, assault, and robbery—<br />

per 100,000 population <strong>in</strong> the 50 states <strong>in</strong> 1995); and MVD (the number of<br />

motor vehicle deaths per 100 million vehicle miles <strong>in</strong> each of the 50 states <strong>in</strong><br />

1995).<br />

We see right away that the mean per capita <strong>in</strong>come predicts the rate of births<br />

to teenagers (r .700) almost as well as does the rate of motor vehicle

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!