01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

5.3 Inference on Two Populations 203<br />

In order to assess these hypotheses, the Mann-Whitney test starts by assigning<br />

ranks to the samples. Let the samples be denoted x1, x2, …, xn <strong>and</strong> y1, y2, …, ym.<br />

The ranking of the xi <strong>and</strong> yi assigns ranks in 1, 2, …, n + m. As an example, let us<br />

consider the following situation:<br />

xi : 12 21 15 8<br />

yi : 9 13 19<br />

The ranking of xi <strong>and</strong> yi would then yield the result:<br />

Variable: X Y X Y X Y X<br />

Data: 8 9 12 13 15 19 21<br />

Rank: 1 2 3 4 5 6 7<br />

The test statistic is the sum of the ranks for one of the variables, say X:<br />

W<br />

X<br />

=<br />

n<br />

∑ i=<br />

1<br />

R(<br />

x ) , 5.31<br />

i<br />

where R(xi) are the ranks assigned to the xi. For the example above, WX = 16.<br />

Similarly, WY = 12 with:<br />

N(<br />

N + 1)<br />

W X + WY<br />

= , total sum of the ranks from 1 through N = n + m.<br />

2<br />

The rationale for using WX as a test statistic is that under the null hypothesis,<br />

P(X > Y ) = ½, one expects the ranks to be r<strong>and</strong>omly distributed between the xi <strong>and</strong><br />

yi, therefore resulting in approximately equal average ranks in each of the two<br />

samples. For small samples, there are tables with the exact probabilities of WX. For<br />

large samples (say m or n above 10), the sampling distribution of WX rapidly<br />

approaches the normal distribution with the following parameters:<br />

n(<br />

N + 1)<br />

2 nm(<br />

N + 1)<br />

µ W = ; σ W = . 5.32<br />

X<br />

X<br />

2<br />

12<br />

Therefore, for large samples, the following test statistic with st<strong>and</strong>ard normal<br />

distribution is used:<br />

z<br />

*<br />

=<br />

WX<br />

± 0.<br />

5 − µ<br />

σ<br />

WX<br />

WX<br />

. 5.33<br />

The 0.5 continuity correction factor is added when one wants to determine<br />

critical points in the left tail of the distribution, <strong>and</strong> subtracted to determine critical<br />

points in the right tail of the distribution.<br />

When compared with its parametric counterpart, the t test, the Mann-Whitney<br />

test has a high power-efficiency, of about 95.5%, for moderate to large n. In some

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!