14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 9 Performing Nonlinear Regression 273<br />

Statistical Details<br />

The nonlinear minimization formula works by taking the first two derivatives of<br />

model, <strong>and</strong> forming the gradient <strong>and</strong> an approximate Hessian as follows:<br />

n<br />

L<br />

=<br />

<br />

ρ( f( β)<br />

)<br />

with respect to the<br />

i = 1<br />

n<br />

∂L<br />

-------<br />

∂<br />

--------------------<br />

ρ( f( β)<br />

) ∂f<br />

=<br />

∂β -------<br />

j ∂f ∂β<br />

i = 1<br />

j<br />

2 n<br />

∂ L ∂ 2 2<br />

---------------------------<br />

ρ( f ( β)<br />

)<br />

∂β j<br />

∂β k<br />

( ∂f ) 2 -------<br />

∂f<br />

--------<br />

∂f ∂<br />

∂β j ∂β ---------------------- ρ( f ( β)<br />

) ∂ f<br />

= <br />

+<br />

k ∂f ∂ βk ∂ β j<br />

i = 1<br />

If f (•)<br />

is linear in the parameters, the second term in the last equation is zero. If not, you can still hope that<br />

its sum is small relative to the first term, <strong>and</strong> use<br />

2 n<br />

∂ L ∂ 2 -------------------------<br />

ρ( f ( β)<br />

) ∂f<br />

-------<br />

∂f<br />

≅<br />

∂β j<br />

∂β <br />

--------<br />

k ( ∂f ) 2 ∂β j<br />

∂β<br />

i = 1<br />

k<br />

The second term is probably small if ρ is the squared residual because the sum of residuals is small. The<br />

term is zero if there is an intercept term. For least squares, this is the term that distinguishes Gauss-Newton<br />

from Newton-Raphson. In JMP, the second term is calculated only if the Second Derivative option is<br />

checked.<br />

Note: The st<strong>and</strong>ard errors, confidence intervals, <strong>and</strong> hypothesis tests are correct only if least squares<br />

estimation is done, or if maximum likelihood estimation is used with a proper negative log likelihood.<br />

ρ (•)<br />

Notes Concerning Derivatives<br />

The nonlinear platform takes symbolic derivatives for formulas with most common operations. This section<br />

shows what type of derivative expressions result.<br />

If you open the Negative Exponential.jmp nonlinear sample data example, the actual formula looks<br />

something like this:<br />

Parameter({b0=0.5, b1=0.5,}b0*(1-Exp(-b1*X)))<br />

The Parameter block in the formula is hidden if you use the formula editor. That is how it is stored in the<br />

column <strong>and</strong> how it appears in the Nonlinear Launch dialog. Two parameters named b0 <strong>and</strong> b1 are given<br />

initial values <strong>and</strong> used in the formula to be fit.<br />

The Nonlinear platform makes a separate copy of the formula, <strong>and</strong> edits it to extract the parameters from<br />

the expression. Then it maps the references to them to the place where they are estimated. Nonlinear takes<br />

the analytic derivatives of the prediction formula with respect to the parameters. If you use the Show<br />

Derivatives comm<strong>and</strong>, you get the resulting formulas listed in the log, like this:

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!