28.02.2014 Views

The Development of Neural Network Based System Identification ...

The Development of Neural Network Based System Identification ...

The Development of Neural Network Based System Identification ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

104 CHAPTER 4 NEURAL NETWORK BASED SYSTEM IDENTIFICATION<br />

minimum <strong>of</strong> the error criterion by iteratively solving the following update equation:<br />

[<br />

−1<br />

θ (i+1) = θ (i) − R (i) (θ) + λ I] (i) G (i) (θ) (4.34)<br />

where I is the identity matrix with a size equal to Hessian R (θ) matrix and the λ (i)<br />

constant is a damping factor used for deciding the step size. <strong>The</strong> value <strong>of</strong> λ (i) is selected<br />

to be λ (i) ≥ 0. <strong>The</strong> usage <strong>of</strong> the constant λ (i) can also be viewed as a blending factor<br />

between Gauss-Newton (GN) and Steepest Descent (SD) update, which provides LM<br />

algorithm with fast convergence speed <strong>of</strong> the GN algorithm and the stability <strong>of</strong> SD<br />

method [Yu and Wilamowski, 2011]. <strong>The</strong> largest update can be possibly achieved by<br />

choosing λ (i) = 0 which gives a full GN step. Shorter step length as in SD algorithm<br />

is achieved by taking λ (i) → ∞ which will cause the diagonal elements <strong>of</strong> R (i) (θ) to<br />

dominate [Ngia and Sjoberg, 2000].<br />

In order to determine λ, the indirect method used in Norgaard [2000] and Fletcher<br />

[1981] is adopted by calculating the following ratio to determine the accuracy <strong>of</strong><br />

approximation:<br />

r (i) = 2 [ (<br />

V N θ (i) ) (<br />

, Z N − VN θ (i) + f (i) )]<br />

, Z N<br />

(f (i)) T (<br />

G θ (i)) (<br />

+ f (i)) T<br />

f (i) λ (i)<br />

} {{ }<br />

reduction approximation<br />

(4.35)<br />

<strong>The</strong> main purpose <strong>of</strong> introducing the ratio calculation is to measure how well the<br />

reduction <strong>of</strong> the criterion V N (θ, Z N ) matches the reduction predicted by approximation<br />

terms in denominator <strong>of</strong> ratio calculation in Equation (4.35). <strong>The</strong> damping factor λ is<br />

adjusted accordingly to the ratio r (i) by some factor [Norgaard, 2000]. <strong>The</strong> procedure<br />

for the LM algorithm using indirect method to determine λ is given in Figure 4.11. <strong>The</strong><br />

reduction approximation (denominator term in Equation (4.35)) is most likely a close<br />

approximation to error criterion V N (θ, Z N ), if the ratio r (i) value is close to one and<br />

parameter λ should be reduced by some factor. However, if the ratio r (i) is small or<br />

a negative value, parameter λ should be increased. Additional stopping criteria are<br />

normally introduced to this algorithm to prevent minimisation problems or to force<br />

early stopping such as stopping criteria based on maximum number <strong>of</strong> iterations, sum<br />

<strong>of</strong> square error that drops below a certain threshold, upper bound for gradient and

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!