14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

654 Statistical Details Appendix A<br />

The Response Models<br />

Fx ( )<br />

1<br />

= ----------------<br />

1 + e – x =<br />

e x<br />

-------------<br />

1 + e x<br />

For r response levels, JMP fits the probabilities that the response is one of r different response levels given by<br />

the data values. The probability estimates must all be positive. For a given configuration of X’s, the<br />

probability estimates must sum to 1 over the response levels. The function that JMP uses to predict<br />

probabilities is a composition of a linear model <strong>and</strong> a multi-response logistic function. This is sometimes<br />

called a log-linear model because the logs of ratios of probabilities are linear models. JMP relates each<br />

response probability to the rth probability <strong>and</strong> fit a separate set of design parameters to these r – 1 models.<br />

P( y = j)<br />

log-------------------<br />

<br />

P( y = r)<br />

<br />

= Xβ () j forj = 1 , …,<br />

r – 1<br />

Fitting Principle For Nominal Response<br />

Base Model<br />

The fitting principle is called maximum likelihood. It estimates the parameters such that the joint probability<br />

for all the responses given by the data is the greatest obtainable by the model. Rather than reporting the<br />

joint probability (likelihood) directly, it is more manageable to report the total of the negative logs of the<br />

likelihood.<br />

The uncertainty (–log-likelihood) is the sum of the negative logs of the probabilities attributed by the model<br />

to the responses that actually occurred in the sample data. For a sample of size n, it is often denoted as H <strong>and</strong><br />

written<br />

n<br />

H = –log( P( y = y i<br />

))<br />

i = 1<br />

If you attribute a probability of 1 to each event that did occur, then the sum of the negative logs is zero for a<br />

perfect fit.<br />

The nominal model can take a lot of time <strong>and</strong> memory to fit, especially if there are many response levels.<br />

JMP tracks the progress of its calculations with an iteration history, which shows the –log-likelihood values<br />

becoming smaller as they converge to the estimates.<br />

The simplest model for a nominal response is a set of constant response probabilities fitted as the occurrence<br />

rates for each response level across the whole data table. In other words, the probability that y is response<br />

level j is estimated by dividing the total sample count n into the total of each response level nj, <strong>and</strong> is written<br />

n j<br />

p j<br />

= ---<br />

n<br />

All other models are compared to this base model. The base model serves the same role for a nominal<br />

response as the sample mean does for continuous models.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!