29.12.2013 Views

Base SAS 9.1.3 Procedures Guide - Acsu Buffalo

Base SAS 9.1.3 Procedures Guide - Acsu Buffalo

Base SAS 9.1.3 Procedures Guide - Acsu Buffalo

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

280 Chapter 3. The UNIVARIATE Procedure<br />

For a symmetric distribution, the symmetrically trimmed mean is an unbiased estimate<br />

of the population mean. However, the trimmed mean does not have a normal<br />

distribution even if the data are from a normal population.<br />

A robust estimate of the variance of the trimmed mean t tk can be based on<br />

the Winsorized sum of squared deviations s 2 wk<br />

, which is defined in the section<br />

“Winsorized Means” on page 278; refer to Tukey and McLaughlin (1963). This can<br />

be used to compute a trimmed t test which is based on the test statistic<br />

t tk = (¯x tk − µ 0 )<br />

SE(¯x tk )<br />

where the standard error of the trimmed mean is<br />

SE(¯x tk ) =<br />

s wk<br />

√<br />

(n − 2k)(n − 2k − 1)<br />

When the data are from a symmetric distribution, the distribution of t tk is approximated<br />

by a Student’s t distribution with n − 2k − 1 degrees of freedom (Tukey and<br />

McLaughlin 1963; Dixon and Tukey 1968).<br />

The “trimmed” 100(1−α)% confidence interval for the location parameter has upper<br />

and lower limits<br />

¯x tk ± t 1−<br />

α<br />

2 ;n−2k−1SE(¯x tk)<br />

where t α 1−<br />

2 ;n−2k−1 is the (1 − α 2<br />

)100th percentile of the Student’s t distribution with<br />

n − 2k − 1 degrees of freedom.<br />

Robust Estimates of Scale<br />

The sample standard deviation, which is the most commonly used estimator of scale,<br />

is sensitive to outliers. Robust scale estimators, on the other hand, remain bounded<br />

when a single data value is replaced by an arbitrarily large or small value. The<br />

UNIVARIATE procedure computes several robust measures of scale, including the<br />

interquartile range, Gini’s mean difference G, the median absolute deviation about<br />

the median (MAD), Q n , and S n . In addition, the procedure computes estimates of<br />

the normal standard deviation σ derived from each of these measures.<br />

The interquartile range (IQR) is simply the difference between the upper and lower<br />

quartiles. For a normal population, σ can be estimated as IQR/1.34898.<br />

Gini’s mean difference is computed as<br />

G = ( 1 ) ∑ |x i − x j |<br />

n<br />

i

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!