13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

44 CONTINGENCY TABLES<br />

when the data are very unbalanced, such as when some categories have many more<br />

observations than other categories. Table 2.7 illustrates this. For the equally spaced<br />

row scores (1, 2, 3, 4, 5), M 2 = 1.83, giving a much weaker conclusion (P = 0.18).<br />

The magnitudes of r and M 2 do not change with transformations of the scores that<br />

maintain the same relative spacings between the categories. For example, scores (1, 2,<br />

3, 4, 5) yield the same correlation as scores (0, 1, 2, 3, 4) or (2, 4, 6, 8, 10) or (10, 20,<br />

30, 40, 50).<br />

An alternative approach assigns ranks <strong>to</strong> the subjects and uses them as the category<br />

scores. For all subjects in a category, one assigns the average of the ranks that would<br />

apply for a complete ranking of the sample from 1 <strong>to</strong> n. These are called midranks.<br />

For example, in Table 2.7 the 17,114 subjects at level 0 for alcohol consumption<br />

share ranks 1 through 17,114. We assign <strong>to</strong> each of them the average of these ranks,<br />

which is the midrank (1 + 17,114)/2 = 8557.5. The 14,502 subjects at level

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!