Dissertation - HQ
Dissertation - HQ
Dissertation - HQ
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
The statistical analysis of vertical distributions 99<br />
where n ′ is the number of non-zero values of a i (i.e. the number of<br />
nets with catches; nets with no catches have a weight of zero). It is easy<br />
to check that if all a i are equal and non-null, equation (5.4) becomes<br />
equation (5.3). All other descriptors, such as standard deviation or<br />
quantiles, can be computed from weighted variance.<br />
Brodeur & Rugen 210 used the z cm to describe the vertical descrip- Definitions of weighted<br />
tion of ichthyoplankton in Alaska, but their formula for the standard variance differ<br />
deviation erroneously added a square factor to the weights compared<br />
to equation (5.4). While their formula also reduces to (5.3) in the case<br />
of equal weights, it emphasises nets in which abundances are high.<br />
As those are generally closer to the z cm, it diminishes variance. An<br />
alternative equation for the weighted variance, often used in software<br />
packages, is<br />
s 2 w =<br />
P ai(z i − ¯z w) 2<br />
P<br />
ai − 1<br />
(5.5)<br />
which conceptually corresponds to considering weights as “repeats”,<br />
hence the number of observations is the sum of weights (n ′ ↔ P i ai).<br />
When the weights are normalised (i.e. their mean is made equal to one)<br />
equations (5.4) and (5.5) are equivalent.<br />
Confidence of the mean and hypothesis testing<br />
While the z cm is an efficient way to summarise vertical distributions<br />
for further analyses, it can also be used by itself, and two z cms may<br />
be compared. Testing hypothesis on means involves a measure of the<br />
confidence in the value of those means (i.e. the standard deviation of<br />
the mean, also called standard error). However, there is no analytical<br />
equivalent to the standard error for weighted data. Gatz & Smith 211<br />
discuss the validity of several estimates of the weighted standard error<br />
by comparing them to a bootstrap method. The best suited is an<br />
approximate ratio variance by Cochran 212<br />
se 2 w =<br />
n ′<br />
(n ′ − 1)( P a i) 2 hX<br />
(aiz i − ā ¯z w) 2<br />
− 2¯z w<br />
X<br />
(ai − ā)(a iz i − ā¯z w)<br />
(5.6)<br />
No analytical definition<br />
of the weighted<br />
standard error<br />
+¯z 2 w<br />
X<br />
(ai − ā) 2i .<br />
The weighted standard error allows to compute weighted confidence<br />
intervals, perform weighted t-tests, and everything that would normally<br />
be available for non weighted data.<br />
The problem of unequal variances<br />
Dealing with z cms means dealing with non-normal data, hence using<br />
non-parametric tests. It is common belief that the popular Wilcoxon-<br />
Mann-Whitney test, and its multi-sample extension, the Kruskal-Wallis<br />
test, do not require the variances to be equal, because they are nonparametric.<br />
This is wrong 213,214 . As, under the null hypothesis, the two<br />
Non-parametric tests<br />
require homogeneity<br />
of variances