25.10.2012 Views

Laurie Bauer - WordPress.com — Get a Free Blog Here

Laurie Bauer - WordPress.com — Get a Free Blog Here

Laurie Bauer - WordPress.com — Get a Free Blog Here

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

165 STATISTICS<br />

means giving a known, though not necessarily equal, chance of selection to<br />

every token in the population. In the simplest case (called simple random sampling)<br />

it means that the chance of selection of any token of our vowel in the<br />

sample is not in any way connected to the selection of any other token in the<br />

sample. In linguistics, what it means to have a random sample is frequently<br />

fudged. Does it mean that no two tokens are produced by the same speaker? In<br />

the simplest case it does, but this requirement is rarely met in linguistic sampling.<br />

A thousand tokens from a single speaker clearly tell us less about the population<br />

than 10 tokens from each of 100 speakers. And if we have 10 tokens<br />

from 100 speakers who live in the same street then we’ll learn less than if those<br />

100 speakers <strong>com</strong>e from all over the city. There are statistical methods, some<br />

quite <strong>com</strong>plex, for taking such associations between sample members into<br />

account when making inferences. However, all the linguist needs to keep in<br />

mind is that although in general we learn more from larger samples, if the individual<br />

responses are linked to each other in any way at all precision is lost.<br />

So far, the assumption has been made that you just want to know something<br />

about your sample as a homogeneous set: what the quality of a vowel is in<br />

Birmingham, for example. But you may believe that speakers in one suburb<br />

speak differently from speakers in another suburb, or that people over 190 cm<br />

in height speak differently from speakers under 160 cm in height. In such cases<br />

you are in effect dividing your survey population into a number of subpopulations,<br />

and then <strong>com</strong>paring the subpopulations. If the subpopulations are large<br />

you may be confident of finding enough people in a random sample (e.g. a<br />

random sample of students in most subjects ought to net a large number of both<br />

males and females, even if the proportions are not 50:50). However, sampling<br />

for minority groups, such as people above 190 cm tall, requires a purpose-built<br />

sampling scheme to make sure you get enough sample members from that subpopulation.<br />

You might need, for example, to visit basketball clubs to find<br />

people over 190 cm tall. Such an approach may <strong>com</strong>plicate the analysis somewhat,<br />

and statistical advice should be sought at the planning stage if you need<br />

to design a <strong>com</strong>plex sampling scheme. How you create a random sample of the<br />

relevant population is an important matter, as is how you ensure a response<br />

from your random sample.<br />

Presenting the sample<br />

Having taken a sample and derived such information as you require from that<br />

sample, you are ready to describe the behaviour of that sample. Where your<br />

sample is one of actual people, you will need to get those people to produce a<br />

certain piece of information for you in a recordable way. Where your sample is<br />

a number of words or sentences, you will need to analyse them in the way relevant<br />

to your experiment (e.g. count how long the words are in letters and/or

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!