14.03.2014 Views

Basic Analysis and Graphing - SAS

Basic Analysis and Graphing - SAS

Basic Analysis and Graphing - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 2 Performing Univariate <strong>Analysis</strong> 69<br />

Statistical Details for the Distribution Platform<br />

Statistical Details for Quantiles<br />

This section describes how quantiles are computed.<br />

To compute the pth quantile of N non-missing values in a column, arrange the N values in ascending order<br />

<strong>and</strong> call these column values y 1 , y 2 ,...,y N . Compute the rank number for the pth quantile as p /100(N +1).<br />

• If the result is an integer, the pth quantile is that rank’s corresponding value.<br />

• If the result is not an integer, the pth quantile is found by interpolation. The pth quantile, denoted q p , is<br />

computed as follows:<br />

where:<br />

– n is the number of non-missing values for a variable<br />

– y 1 , y 2, ..., y n represents the ordered values of the variable<br />

– y n+1 is taken to be y n<br />

– i is the integer part <strong>and</strong> f is the fractional part of (n+1)p.<br />

– (n + 1)p = i + f<br />

For example, suppose a data table has 15 rows <strong>and</strong> you want to find the 75th <strong>and</strong> 90th quantile values of a<br />

continuous column. After the column is arranged in ascending order, the ranks that contain these quantiles<br />

are computed as follows:<br />

--------<br />

75<br />

( 15 + 1) = 12 <strong>and</strong> --------<br />

90<br />

( 15 + 1) = 14.4<br />

100<br />

100<br />

The value y 12 is the 75th quantile. The 90th quantile is interpolated by computing a weighted average of the<br />

14th <strong>and</strong> 15th ranked values as y 90 =0.6y 14 +0.4y 15 .<br />

Statistical Details for Summary Statistics<br />

Mean<br />

Std Dev<br />

q p<br />

= ( 1 – f)y i<br />

+ ()y f i + 1<br />

This section contains statistical details for specific statistics in the Summary Statistics report.<br />

The mean is the sum of the non-missing values divided by the number of non-missing values. If you<br />

assigned a Weight or Freq variable, the mean is computed by JMP as follows:<br />

1. Each column value is multiplied by its corresponding weight or frequency.<br />

2. These values are added <strong>and</strong> divided by the sum of the weights or frequencies.<br />

The st<strong>and</strong>ard deviation measures the spread of a distribution around the mean. It is often denoted as s <strong>and</strong> is<br />

the square root of the sample variance, denoted s 2 .

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!