28.12.2013 Views

preface to fifteenth edition

preface to fifteenth edition

preface to fifteenth edition

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2.128 SECTION 2<br />

Example 10 Six samples from a bulk chemical shipment averaged 77.50% active ingredient<br />

with s 1.45%. The manufacturer claimed 80.00%. Can this claim be supported?<br />

A one-tailed test is required since the alternative hypothesis states that the population parameter<br />

is equal <strong>to</strong> or less than the hypothesized value.<br />

77.50 80.00<br />

t p6 1 3.86<br />

1.45<br />

Since t 0.95 2.01, and t 0.99 3.36, the hypothesis is rejected at both the 0.05 and the 0.01<br />

levels of significance. It is extremely unlikely that the claim is justified.<br />

2.3.8 The Chi-square ( 2 ) Distribution<br />

The 2 distribution describes the behavior of variances. Actually there is not a single 2 distribution<br />

but a whole set of distributions. Each distribution depends upon the number of degrees of freedom<br />

(designated variously as df, d.f., or f ) in that distribution. Table 2.28 is laid out so that the horizontal<br />

axis is labeled with probability levels, while the vertical axis is listed in descending order of increasing<br />

number of degrees of freedom. The entries increase both as you read down and across the<br />

table. Although Table 2.28 does not display the values for the mid-range of the distributions, at the<br />

50% point of each distribution, the expected value of 2 is equal <strong>to</strong> the degrees of freedom. Estimates<br />

of the variance are uncertain when based only on a few degrees of freedom. With the 10 samples<br />

in Example 11, the standard deviation can vary by a large fac<strong>to</strong>r purely by random chance alone.<br />

Even 31 samples gives a spread of standard deviation of 2.6 at the 95% confidence level.<br />

Understanding the 2 distribution allows us <strong>to</strong> calculate the expected values of random variables<br />

that are normally and independently distributed. In least squares multiple regression, or in calibration<br />

work in general, there is a basic assumption that the error in the response variable is random and<br />

normally distributed, with a variance that follows a 2 distribution.<br />

Confidence limits for an estimate of the variance may be calculated as follows. For each group<br />

of samples a standard deviation is calculated. These estimates of possess a distribution called the<br />

2 distribution:<br />

s 2<br />

2<br />

(2.15)<br />

2<br />

/df<br />

The upper and lower confidence limits for the standard deviation are obtained by dividing<br />

(N 1)s 2 by two entries taken from Table 2.28. The estimate of variance at the 90% confidence<br />

limits is for use in the entries<br />

2<br />

and<br />

2<br />

0.05 0.95<br />

(for 5% and 95%) with N degrees of freedom.<br />

Example 11 The variance obtained for 10 samples is (0.65) 2 . 2 is known <strong>to</strong> be (0.75) 2 .How<br />

reliable is s 2 as an estimate of 2 ?<br />

2 2<br />

s (N 1) s (N 1)<br />

2<br />

<br />

2 2<br />

0.975 0.025<br />

(0.65)<br />

2<br />

(10 1) (0.65)<br />

2<br />

(10 1)<br />

2<br />

<br />

19.02 2.70<br />

2<br />

0.20 1.43<br />

Thus, only one time in 40 will 9s 2 / 2 be less than 2.70 by chance alone. Similarly, only one time

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!