An Introduction to Genetic Algorithms - Boente

More documents

Recommendations

Info

(4.1) The goal is to find n = n* that minimizes L(N n, n). This can be done by taking the derivative of L(N n, n) with respect to n, setting it to zero, and solving for n: (4.2) To solve this equation, we need to express q in terms of n so that we can find dq/ dn. Recall that qis the probability that A1(N, n) = A1. Suppose A1(N, n) indeed is A1; then A1 was given ntrials. Let S n 1be the sum of the payoffs of the ntrials given to A1, and let s Nn 2 be the sum of the payoffs of the N n trials given to A 2. Then (4.3) that is, the probability that the observed average payoff of A2 is higher than that of A1. Equivalently, (4.4) Chapter 4: Theoretical Foundations of Genetic Algorithms (For subtle reasons, this is actually only an approximation; see Holland 1975, chapter 5.) Since S n 1 and S Nn 2 are random variables, their difference is also a random variable with a well−defined distribution. Pr((S n 1/n S Nn 2/(N n)) < 0) is simply the area under the part of the distribution that is less than zero. The problem now is to compute this area—a tricky task. Holland originally approximated it by using the central limit theorem to assume a normal distribution. Dan Frantz (as described in chapter 10 of the second edition of Holland 1975) corrected the original approximation using the theory of large deviations rather than the central limit theorem. Here the mathematics get complicated (as is often the case for easy−to−state problems such as the Two−Armed Bandit problem). According to Frantz, the optimal allocation of trials n * to the observed second best of the two random variables corresponding to the Two−Armed Bandit problem is approximated by where c1,c2, and c3 are positive constants defined by Frantz. (Here In denotes the natural logarithm.) The details of this solution are of less concern to us than its form. This can be seen by rearranging the terms and performing some algebra to get an expression for N n *, the optimal allocation of trials to the observed better arm: As n* increases, e n */ 2c 1dominates everything else, so we can further approximate (letting c = 1/2 c1): 90
In short, the optimal allocation of trials N n* to the observed better arm should increase exponentially with the number of trials to the observed worse arm. Interpretation of the Solution The Two−Armed Bandit problem is a simple model of the general problem of how to allocate resources in the face of uncertainty. This is the "exploration versus exploitation" problem faced by an adaptive system. The Schema Theorem suggests that, given a number of assumptions, the GA roughly adopts a version of the optimal strategy described above:over time, the number of trials allocated to the best observed schemas in the population increases exponentially with respect to the number of trials allocated to worse observed schemas. The GA implements this search strategy via implicit parallelism, where each of the nindividuals in population can be viewed as a sample of 2 l different schemas. The number of instances of a given schema H in the population at any time is related to its observed average performance, giving (under some conditions) an exponential growth rate for highly fit schemas. However, the correct interpretation of the Two−Armed Bandit analogy for schemas is not quite so simple. Grefenstette and Baker (1989) illustrate this with the following fitness function: (4.5) Chapter 4: Theoretical Foundations of Genetic Algorithms (Recall that "x Î H" denotes "x is an instance of schema H.") Let u(H) be the "static" average fitness of a schema H (the average over all instances of the schema in the search space), and let Û(H, t) be the observed average fitness of H at time t (the average fitness of instances of H in the population at time t). It is easy to show that u(1 *···* = ½ and u(0 *···*) = 1. But under a GA, via selection, 1 *···* will dominate the population very quickly in the form of instances of 111 *···* since instances of the latter will be strongly selected in the population. This means that, under a GA, Û(1 *···*, t) H 2 after a small number of time steps, and 1 *···*will receive many more samples than 0 *···* even though its static average fitness is lower. The problem here is that in the Two−Armed Bandit each arm is an independent random variable with a fixed distribution, so the likelihood of a particular outcome does not change from play to play. But in the GA different "arms" (schemas) interact; the observed payoff for 111 *···* has a strong (if not determining) effect on the observed payoff for 1 *···*. Unlike in the Two−Armed Bandit problem, the additional trials to 1 *···* will not provide additional information about its true payoff rate, since they all end up being trials to 111 *···*. In short, the GA cannot be said to be sampling schemas independently to estimate their true payoffs. Grefenstette and Baker's example shows that the GA does not play a 3 L −armed bandit with all 3 L possible schemas competing as arms. A more correct interpretation (John Holland, personal communication) is that the GA plays a 2 k −armed bandit in each order−k "schema partition," defined as a division of the search space into 2 k directly competing schemas. For example, the partition d *···* consists of the two schemas 0 *···* and 1 *···*. Likewise, the partition *d *d *···* consists of the four schemas *0*0*···*,*0*1*···*,*1*0*···*, and *1*1*···*. The idea is that the best observed schema within a partition will receive exponentially more samples than the next best, and so on. Furthermore, the GA will be close to an optimal 2 k −armed bandit strategy only for partitions in which the current population's distribution of fitnesses in the competing schemas is reasonably uniform (Holland, personal communication cited in Grefenstette 1991b). Thus, the schema 91
Page 2 and 3:
An Introduction to Genetic Algorith
Page 4 and 5:
Table of Contents Chapter 4: Theore
Page 6 and 7:
Chapter 1: Genetic Algorithms: An O
Page 8 and 9:
Page 10 and 11:
1.4 SEARCH SPACES AND FITNESS LANDS
Page 12 and 13:
potential energy is a measure of ho
Page 14 and 15:
A simple method of implementing fit
Page 16 and 17:
from an initial state to a goal. Fo
Page 18 and 19:
Robert Axelrod of the University of
Page 20 and 21:
Page 22 and 23:
Page 24 and 25:
Page 26 and 27:
instances of H at time t, and let
Page 28 and 29:
What is the total payoff after 10 g
Page 30 and 31:
6. * c. d. a. b. Chapter 1: Genetic
Page 32 and 33:
As a simple example, consider a pro
Page 34 and 35:
Chapter 2: Genetic Algorithms in Pr
Page 36 and 37:
it to get one fitness case correct:
Page 38 and 39:
Evolving Cellular Automata Chapter
Page 40 and 41:
Chapter 2: Genetic Algorithms in Pr
Page 42 and 43:
up most of the computation time. Ch
Page 44 and 45: Chapter 2: Genetic Algorithms in Pr
Page 46 and 47: the prospect of using GAs to automa
Page 48 and 49: where Ã is the standard deviation
Page 54 and 55: the brain. In a feedforward network
Page 58 and 59: Figure 2.21: An illustration of Mil
Page 62 and 63: experiments with grammatical encodi
Page 64 and 65: function of the average error of th
Page 66 and 67: COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 70 and 71: simulated generations, and such sim
Page 72 and 73: environments, they can fairly quick
Page 74 and 75: Chapter 3: Genetic Algorithms in Sc
Page 76 and 77: Figure 3.6: A schematic illustratio
Page 82 and 83: an initial population in which the
Page 88 and 89: these random effects, Bedau and Pac
Page 90 and 91: 3. * 4. * the female child in the n
Page 92 and 93: calculating the observed average fi
Page 96 and 97: competition in the d *···* parti
Page 98 and 99: schemas with the best observed fitn
Page 100 and 101: Steepest−ascent hill climbing (SA
Page 102 and 103: (If the algorithm spends only 1/m o
Page 104 and 105: strings—then in principle crossov
Page 106 and 107: (4.7) Chapter 4: Theoretical Founda
Page 108 and 109: 2. 3. Calculate the fitness f(x) of
Page 110 and 111: Defining ri,j(k) and is somewhat tr
Page 112 and 113: Results of the Formalization How ca
Page 114 and 115: Chapter 4: Theoretical Foundations
Page 118 and 119: Figure 4.5: Predicted and observed
Page 120 and 121: COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 122 and 123: prone to rather arbitrary orderings
Page 124 and 125: Inversion works by choosing two poi
Page 128 and 129: The one hitch is that, as was seen
Page 130 and 131: used in the work of Tanese (1989),
Page 132 and 133: Two individuals are chosen at rando
Page 136 and 137: strategies community, in which para
Page 138 and 139: * 10. * 11. * 12. * Chapter 4: Theo
Page 140 and 141: Chapter 6: Conclusions and Future D
Page 142 and 143: Chapter 6: Conclusions and Future D
Page 144 and 145:
Appendix A: Selected General Refere
Page 146 and 147:
Evolution Artificielle Foundations
Page 148 and 149:
Kaufmann. Appendix B: Other Resourc
Page 150 and 151:
Morgan Kaufmann. Appendix B: Other
Page 152 and 153:
Appendix B: Other Resources Forrest
Page 154 and 155:
Appendix B: Other Resources Grefens
Page 156 and 157:
Kirkpatrick, S., Gelatt, C.D., Jr.,
Page 158 and 159:
Appendix B: Other Resources Mitchel
Page 160 and 161:
Appendix B: Other Resources Roughga
Page 162:
Appendix B: Other Resources Vose, M
show all

An Introduction to Genetic Algorithms - Boente

Create successful ePaper yourself

Delete template?

Save as template?