14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Appendix A Statistical Details 653<br />

The Response Models<br />

The Response Models<br />

JMP fits linear models to three different kinds of responses: continuous, nominal, <strong>and</strong> ordinal. The models<br />

<strong>and</strong> methods available in JMP are practical, are widely used, <strong>and</strong> suit the need for a general approach in a<br />

statistical software tool. As with all statistical software, you are responsible for learning the assumptions of<br />

the models you choose to use, <strong>and</strong> the consequences if the assumptions are not met. For more information<br />

see “The Usual Assumptions” on page 672 in this chapter.<br />

Continuous Responses<br />

When the response column (column assigned the Y role) is continuous, JMP fits the value of the response<br />

directly. The basic model is that for each observation,<br />

Y = (some function of the X’s <strong>and</strong> parameters) + error<br />

Statistical tests are based on the assumption that the error term in the model is normally distributed.<br />

Fitting Principle for Continuous Response<br />

Base Model<br />

The Fitting principle is called least squares. The least squares method estimates the parameters in the model<br />

to minimize the sum of squared errors. The errors in the fitted model, called residuals, are the difference<br />

between the actual value of each observation <strong>and</strong> the value predicted by the fitted model.<br />

The least squares method is equivalent to the maximum likelihood method of estimation if the errors have a<br />

normal distribution. This means that the analysis estimates the model that gives the most likely residuals.<br />

The log-likelihood is a scale multiple of the sum of squared errors for the normal distribution.<br />

The simplest model for continuous measurement fits just one value to predict all the response values. This<br />

value is the estimate of the mean. The mean is just the arithmetic average of the response values. All other<br />

models are compared to this base model.<br />

Nominal Responses<br />

Nominal responses are analyzed with a straightforward extension of the logit model. For a binary (two-level)<br />

response, a logit response model is<br />

which can be written<br />

where<br />

log--------------------<br />

P( y = 1)<br />

<br />

<br />

P( y = 2)<br />

<br />

=<br />

Xβ<br />

P( y = 1) = F( Xβ)<br />

Fx ( )<br />

is the cumulative distribution function of the st<strong>and</strong>ard logistic distribution

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!