24.10.2014 Views

Dissertation - HQ

Dissertation - HQ

Dissertation - HQ

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

The statistical analysis of vertical distributions 99<br />

where n ′ is the number of non-zero values of a i (i.e. the number of<br />

nets with catches; nets with no catches have a weight of zero). It is easy<br />

to check that if all a i are equal and non-null, equation (5.4) becomes<br />

equation (5.3). All other descriptors, such as standard deviation or<br />

quantiles, can be computed from weighted variance.<br />

Brodeur & Rugen 210 used the z cm to describe the vertical descrip- Definitions of weighted<br />

tion of ichthyoplankton in Alaska, but their formula for the standard variance differ<br />

deviation erroneously added a square factor to the weights compared<br />

to equation (5.4). While their formula also reduces to (5.3) in the case<br />

of equal weights, it emphasises nets in which abundances are high.<br />

As those are generally closer to the z cm, it diminishes variance. An<br />

alternative equation for the weighted variance, often used in software<br />

packages, is<br />

s 2 w =<br />

P ai(z i − ¯z w) 2<br />

P<br />

ai − 1<br />

(5.5)<br />

which conceptually corresponds to considering weights as “repeats”,<br />

hence the number of observations is the sum of weights (n ′ ↔ P i ai).<br />

When the weights are normalised (i.e. their mean is made equal to one)<br />

equations (5.4) and (5.5) are equivalent.<br />

Confidence of the mean and hypothesis testing<br />

While the z cm is an efficient way to summarise vertical distributions<br />

for further analyses, it can also be used by itself, and two z cms may<br />

be compared. Testing hypothesis on means involves a measure of the<br />

confidence in the value of those means (i.e. the standard deviation of<br />

the mean, also called standard error). However, there is no analytical<br />

equivalent to the standard error for weighted data. Gatz & Smith 211<br />

discuss the validity of several estimates of the weighted standard error<br />

by comparing them to a bootstrap method. The best suited is an<br />

approximate ratio variance by Cochran 212<br />

se 2 w =<br />

n ′<br />

(n ′ − 1)( P a i) 2 hX<br />

(aiz i − ā ¯z w) 2<br />

− 2¯z w<br />

X<br />

(ai − ā)(a iz i − ā¯z w)<br />

(5.6)<br />

No analytical definition<br />

of the weighted<br />

standard error<br />

+¯z 2 w<br />

X<br />

(ai − ā) 2i .<br />

The weighted standard error allows to compute weighted confidence<br />

intervals, perform weighted t-tests, and everything that would normally<br />

be available for non weighted data.<br />

The problem of unequal variances<br />

Dealing with z cms means dealing with non-normal data, hence using<br />

non-parametric tests. It is common belief that the popular Wilcoxon-<br />

Mann-Whitney test, and its multi-sample extension, the Kruskal-Wallis<br />

test, do not require the variances to be equal, because they are nonparametric.<br />

This is wrong 213,214 . As, under the null hypothesis, the two<br />

Non-parametric tests<br />

require homogeneity<br />

of variances

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!