14.03.2014 Views

Basic Analysis and Graphing - SAS

Basic Analysis and Graphing - SAS

Basic Analysis and Graphing - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 2 Performing Univariate <strong>Analysis</strong> 79<br />

Statistical Details for the Distribution Platform<br />

Normal Mixtures<br />

The Normal Mixtures option fits a mixture of normal distributions. This flexible distribution is capable of<br />

fitting multi-modal data.<br />

Fit a mixture of two or three normal distributions by selecting the Normal 2 Mixture or Normal 3 Mixture<br />

options. Alternatively, you can fit a mixture of k normal distributions by selecting the Other option. A<br />

separate mean, st<strong>and</strong>ard deviation, <strong>and</strong> proportion of the whole is estimated for each group.<br />

pdf:<br />

k π i<br />

σ ---- φ x – μ i <br />

------------<br />

<br />

i σ<br />

i=<br />

1 i <br />

k<br />

E(x) = π i<br />

μ i<br />

i=<br />

1<br />

k<br />

k 2<br />

Var(x) = π i<br />

μ2 i<br />

σ2<br />

( + i<br />

) – <br />

π i<br />

μ i <br />

i = 1<br />

i=<br />

1 <br />

where μ i , σ i , <strong>and</strong> π i are the respective mean, st<strong>and</strong>ard deviation, <strong>and</strong> proportion for the i th group, <strong>and</strong><br />

φ ( . )<br />

is the st<strong>and</strong>ard normal pdf.<br />

Smooth Curve<br />

The Smooth Curve option fits a smooth curve using nonparametric density estimation (kernel density<br />

estimation). The smooth curve is overlaid on the histogram <strong>and</strong> a slider appears beneath the plot. Control<br />

the amount of smoothing by changing the kernel st<strong>and</strong>ard deviation with the slider. The initial Kernel Std<br />

estimate is formed by summing the normal densities of the kernel st<strong>and</strong>ard deviation located at each data<br />

point.<br />

Johnson Su, Johnson Sb, Johnson Sl<br />

The Johnson system of distributions contains three distributions that are all based on a transformed normal<br />

distribution. These three distributions are the Johnson Su, which is unbounded for Y; the Johnson Sb,<br />

which is bounded on both tails (0 < Y < 1); <strong>and</strong> the Johnson Sl, leading to the lognormal family of<br />

distributions.<br />

Note: The S refers to system, the subscript of the range. Although we implement a different method,<br />

information about selection criteria for a particular Johnson system can be found in Slifker <strong>and</strong> Shapiro<br />

(1980).<br />

Johnson distributions are popular because of their flexibility. In particular, the Johnson distribution system<br />

is noted for its data-fitting capabilities because it supports every possible combination of skewness <strong>and</strong><br />

kurtosis.<br />

If Z is a st<strong>and</strong>ard normal variate, then the system is defined as follows:<br />

Z = γ+<br />

δfy ( )

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!