25.11.2014 Views

Biostatistics

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

510 CHAPTER 10 MULTIPLE REGRESSION AND CORRELATION<br />

10.5.5 Refer to Exercise 10.3.5 and let x 1j ¼ 90 and x 2j ¼ 80:<br />

10.5.6 Refer to Exercise 10.3.6 and let x 1j ¼ 50; x 2j ¼ 95:0; x 3j ¼ 2:00; x 4j ¼ 6:00; x 5j ¼ 75, and<br />

x 6j ¼ 70:<br />

10.6 THE MULTIPLE CORRELATION MODEL<br />

We pointed out in the preceding chapter that while regression analysis is concerned with<br />

the form of the relationship between variables, the objective of correlation analysis is to<br />

gain insight into the strength of the relationship. This is also true in the multivariable case,<br />

and in this section we investigate methods for measuring the strength of the relationship<br />

among several variables. First, however, let us define the model and assumptions on which<br />

our analysis rests.<br />

The Model Equation<br />

We may write the correlation model as<br />

y j ¼ b 0 þ b 1 x 1j þ b 2 x 2j þþb k x kj þ e j (10.6.1)<br />

where y j is a typical value from the population of values of the variable Y, the b’s are the<br />

regression coefficients defined in Section 10.2, and the x ij are particular (known) values of<br />

the random variables X i . This model is similar to the multiple regression model, but there is<br />

one important distinction. In the multiple regression model, given in Equation 10.2.1, the X i<br />

are nonrandom variables, but in the multiple correlation model the X i are random variables.<br />

In other words, in the correlation model there is a joint distribution of Y and the X i that we<br />

call a multivariate distribution. Under this model, the variables are no longer thought of as<br />

being dependent or independent, since logically they are interchangeable and either of the<br />

X i may play the role of Y.<br />

Typically, random samples of units of association are drawn from a population of<br />

interest, and measurements of Y and the X i are made.<br />

A least-squares plane or hyperplane is fitted to the sample data by methods described<br />

in Section 10.3, and the same uses may be made of the resulting equation. Inferences may<br />

be made about the population from which the sample was drawn if it can be assumed that<br />

the underlying distribution is normal, that is, if it can be assumed that the joint distribution<br />

of Yand X i is a multivariate normal distribution. In addition, sample measures of the degree<br />

of the relationship among the variables may be computed and, under the assumption that<br />

sampling is from a multivariate normal distribution, the corresponding parameters may be<br />

estimated by means of confidence intervals, and hypothesis tests may be carried out.<br />

Specifically, we may compute an estimate of the multiple correlation coefficient that<br />

measures the dependence between Y and the X i . This is a straightforward extension of the<br />

concept of correlation between two variables that we discuss in Chapter 9. We may also<br />

compute partial correlation coefficients that measure the intensity of the relationship<br />

between any two variables when the influence of all other variables has been removed.<br />

The Multiple Correlation Coefficient As a first step in analyzing the<br />

relationships among the variables, we look at the multiple correlation coefficient.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!