27.06.2013 Views

Evolution and Optimum Seeking

Evolution and Optimum Seeking

Evolution and Optimum Seeking

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

108 <strong>Evolution</strong> Strategies for Numerical Optimization<br />

The question remains of how tochoose the r<strong>and</strong>om vectors z (g) .Thischoice has the<br />

r^ole of mutation. Mutations are understood nowadays to be r<strong>and</strong>om, purposeless events,<br />

which furthermore only occur very rarely. Ifoneinterprets them, as is done here, as a sum<br />

of many individual events, it is natural to choose a probability distribution according to<br />

which small changes occur frequently, but large ones only rarely (the central limit theorem<br />

of statistics). For discrete variations one can use a binomial distribution, for example, for<br />

continuous variations a Gaussian or normal distribution.<br />

Two requirements then arise together by analogy with natural evolution:<br />

That the expectation value i for a component zi has the value zero<br />

2<br />

That the variance i ,theaverage squared deviation from the mean, is small<br />

The probability density function for normally distributed r<strong>and</strong>om events is (e.g., Heinhold<br />

<strong>and</strong> Gaede, 1972):<br />

w(zi) =<br />

1<br />

p<br />

2 i<br />

exp<br />

!<br />

(5.1)<br />

; (zi ; i) 2<br />

2 2 i<br />

2<br />

If i = 0, one obtains a so-called (0 i ) normal distribution. There are still however a<br />

total of n free parameters f i i= 1(1)ng with which to specify the st<strong>and</strong>ard deviations<br />

of the individual r<strong>and</strong>om components. By analogy with other, deterministic search strategies,<br />

the i can be called step lengths, in the sense that they represent average values of<br />

the lengths of the r<strong>and</strong>om steps.<br />

For the occurrence of a particular r<strong>and</strong>om vector z = fzi i = 1(1)ng, with the<br />

2<br />

independent (0 i ) distributed components zi, the probability density function is<br />

w(z1z2:::zn) =<br />

nY<br />

i=1<br />

w(zi) =<br />

(2 ) n 2<br />

or more compactly, if i = for all i =1(1)n,<br />

w(z) =<br />

1<br />

p2<br />

! n<br />

1<br />

nQ<br />

i=1<br />

exp<br />

i<br />

exp<br />

;zz T<br />

2 2<br />

!<br />

; 1<br />

2<br />

nX<br />

i=1<br />

zi<br />

i<br />

2 !<br />

(5.2)<br />

(5.3)<br />

q Pn<br />

i=1 z 2 i a p 2 distribution is ob-<br />

For the length of the overall r<strong>and</strong>om vector S =<br />

2 tained. The distribution with n degrees of freedom approximates, for large n, to<br />

a( q n ; 1<br />

2 2<br />

) normal distribution. Thus the expectation value for the total length<br />

2<br />

of the r<strong>and</strong>om vector for many variables is E(S) = p n,thevariance is D2 (S) =<br />

E((S ; E(S)) 2 )= 2<br />

, <strong>and</strong> the coe cient ofvariation is<br />

2<br />

D(S)<br />

E(S)<br />

= 1<br />

p 2 n<br />

This means that the most probable value for the length of the r<strong>and</strong>om vector at constant<br />

increases as the square root of the number of variables <strong>and</strong> the relative width of variation<br />

decreases with the reciprocal square root of parameters.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!