25.10.2012 Views

Laurie Bauer - WordPress.com — Get a Free Blog Here

Laurie Bauer - WordPress.com — Get a Free Blog Here

Laurie Bauer - WordPress.com — Get a Free Blog Here

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

THE LINGUISTICS STUDENT’S HANDBOOK 168<br />

to the same population, that each sample you have collected is a different<br />

random sample from the same population. So the null hypothesis or the<br />

sceptic’s position is that your two samples, though not identical, are not<br />

different enough to be samples of different populations.<br />

Statisticians have various tests available to them for deciding whether the<br />

null hypothesis is or is not correct, and which test is appropriate depends<br />

upon factors such as the kind of data you are using (e.g. whether it is measurements<br />

on a physical scale or simple categorical answers to individual<br />

questions – yes versus no, occupational types, and so on), and if you have a<br />

physical scale whether the scale is extendable in both directions or just one.<br />

As you read you will start to be<strong>com</strong>e familiar with the names of some of these<br />

tests, such as Student’s t (one- or two-tailed t-tests), chi-squared (� 2 ),<br />

Wilcoxon, etc. If you want to be well informed, you will check out when each<br />

should be used. But if you assume that each has been appropriately used, you<br />

will eventually <strong>com</strong>e to a p-value, which is a statement of the probability that<br />

your samples could have <strong>com</strong>e from a population for which the null hypothesis<br />

is true. A value of p will be given which will be between zero and one.<br />

Large values close to p = 1 support the null hypothesis, low values close to p<br />

= 0 indicate that the data set is inconsistent with the null hypothesis. This is<br />

probably what you were hoping for when you asked the question. In practice<br />

we never see p-values of exactly zero or one, but an answer near to zero is good<br />

enough to conclude that there is a real difference between the two sets of<br />

samples you have taken. But how close does it have to be to zero before you<br />

can claim success?<br />

If the probability is greater than 0.05, statisticians agree that it is not significant.<br />

This is a technical usage of the word significant (though one based<br />

on the ordinary language term). For any value greater than that, you cannot<br />

assume that the null hypothesis is in<strong>com</strong>patible with the data from your<br />

samples. Because significant is used in this very precise way in statistics, if you<br />

do not mean it in its statistical sense you should avoid it in favour of some<br />

synonym (important, indicative, suggestive, telling) when discussing results to<br />

which you might in principle have applied a statistical test.<br />

What does it mean if we say that p = 0.05, or, more likely p < 0.05 (‘p is less<br />

than 0.05’)? What it means is that you have seen quite a large or quite a consistent<br />

difference in the properties of the samples (e.g. the sample means), and<br />

that there is only one chance in twenty that the two samples you took could be<br />

samples from the same population and still look that different. If you think of<br />

this in terms of betting money, those odds seem pretty good. If you put up £1,<br />

and nineteen times out of twenty you win £1, and the other time you lose your<br />

£1, you will be able to afford your own beer all evening. But if you think of it<br />

in terms of something more serious, you would not like those odds. You would<br />

not drive on a motorway if the odds of having a serious accident there were one

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!