27.10.2014 Views

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

Russel-Research-Method-in-Anthropology

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Sampl<strong>in</strong>g Theory 181<br />

Look up what’s called the critical value of t <strong>in</strong> appendix C. In the column<br />

for .025, we see that the value is 2.571 with 5 degrees of freedom. Degrees of<br />

freedom are one less than the size of the sample, so for a sample of six we<br />

need a t statistic of 2.571 to atta<strong>in</strong> 95% confidence. (The concept of degrees<br />

of freedom is described further <strong>in</strong> chapter 19 <strong>in</strong> the section on t-tests. And<br />

keep <strong>in</strong> m<strong>in</strong>d that we’re <strong>in</strong>terested <strong>in</strong> the modulus, or absolute value of t. A<br />

value of –2.571 is just as statistically significant as a value of 2.571.)<br />

So, with small samples—which, for practical purposes, means less than 30<br />

units of analysis—we use appendix C (for t) <strong>in</strong>stead of appendix B (for z) to<br />

determ<strong>in</strong>e the confidence limits around the mean of our estimate. You can see<br />

from appendix C that for large samples—30 or more—the difference between<br />

the t and the z statistics is negligible. (The t-value of 2.042 for 30 degrees of<br />

freedom—which means a sample of 31—is very close to 1.96.)<br />

The Catch<br />

Suppose that <strong>in</strong>stead of estimat<strong>in</strong>g the <strong>in</strong>come of a population with a sample<br />

of 100, we use a sample of 10 and get the same result—RM12,600 and a standard<br />

deviation of RM4,000. For a sample this size, we use the t distribution.<br />

With 9 degrees of freedom and an alpha value of .025, we have a t value of<br />

2.262. For a normal curve, 95% of all scores fall with<strong>in</strong> 1.96 standard errors<br />

of the mean. The correspond<strong>in</strong>g t value is 2.262 standard errors. Substitut<strong>in</strong>g<br />

<strong>in</strong> the formula, we get:<br />

RM12,600 2.262 RM4,000 RM9,739 to RM15,461<br />

10<br />

But there’s a catch. With a large sample (greater than 30), we know from the<br />

central limit theorem that the sampl<strong>in</strong>g distribution will be normal even if the<br />

population isn’t. Us<strong>in</strong>g the t distribution with a small sample, we can calculate<br />

the confidence <strong>in</strong>terval around the mean of our sample only under the assumption<br />

that the population is normally distributed.<br />

In fact, look<strong>in</strong>g back at figure 7.5, we know that the distribution of real data<br />

is not perfectly normal. It is somewhat skewed (more about skewed distributions<br />

<strong>in</strong> chapter 19). In real research, we’d never take a sample from a set of<br />

just 50 data po<strong>in</strong>ts—we’d do all our calculations on the full set of the actual<br />

data. When we take samples, it’s because we don’t know what the distribution<br />

of the data looks like. And that’s why sample size counts.<br />

Calculat<strong>in</strong>g Sample Size for Estimat<strong>in</strong>g Proportions<br />

And now for proportions. What we’ve learned so far about estimat<strong>in</strong>g the<br />

mean of cont<strong>in</strong>uous variables (like <strong>in</strong>come and percentages) is applicable to<br />

the estimation of proportions as well.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!