An Introduction to Genetic Algorithms - Boente

More documents

Recommendations

Info

prone to rather arbitrary orderings. Many−Character and Real−Valued Encodings For many applications, it is most natural to use an alphabet of many characters or real numbers to form chromosomes. Examples include Kitano's many−character representation for graph−generation grammars, Meyer and Packard's real−valued representation for condition sets, Montana and Davis's real−valued representation for neural−network weights, and Schultz−Kremer's real−valued representation for torsion angles in proteins. Holland's schema−counting argument seems to imply that GAs should exhibit worse performance on multiple−character encodings than on binary encodings. However, this has been questioned by some (see, e.g., Antonisse 1989). Several empirical comparisons between binary encodings and multiple−character or real−valued encodings have shown better performance for the latter (see, e.g., Janikow and Michalewicz 1991; Wright 1991). But the performance depends very much on the problem and the details of the GA being used, and at present there are no rigorous guidelines for predicting which encoding will work best. Tree Encodings Chapter 4: Theoretical Foundations of Genetic Algorithms Tree encoding schemes, such as John Koza's scheme for representing computer programs, have several advantages, including the fact that they allow the search space to be open−ended (in principle, any size tree could be formed via crossover and mutation). This open−endedness also leads to some potential pitfalls. The trees can grow large in uncontrolled ways, preventing the formation of more structured, hierarchical candidate solutions. (Koza's (1992, 1994) "automatic definition of functions" is one way in which GP can be encouraged to design hierarchically structured programs.) Also, the resulting trees, being large, can be very difficult to understand and to simplify. Systematic experiments evaluating the usefulness of tree encodings and comparing them with other encodings are only just beginning in the genetic programming community. Likewise, as yet there are only very nascent attempts at extending GA theory to tree encodings (see, e.g., Tackett 1994; O'Reilly and Oppacher 1995). These are only the most common encodings; a survey of the GA literature will turn up experiments on several others. How is one to decide on the correct encoding for one's problem? Lawrence Davis, a researcher with much experience applying GAs to realworld problems, strongly advocates using whatever encoding is the most natural for your problem, and then devising a GA that can use that encoding (Davis 1991). Until the theory of GAs and encodings is better formulated, this might be the best philosophy; as can be seen from the examples presented in this book, most research is currently done by guessing at an appropriate encoding and then trying out a particular version of the GA on it. This is not much different from other areas of machine learning; for example, encoding a learning problem for a neural net is typically done by trial and error. One appealing idea is to have the encoding itself adapt so that the GA can make better use of it. 5.3 ADAPTING THE ENCODING Choosing a fixed encoding ahead of time presents a paradox to the potential GA user: for any problem that is hard enough that one would want to use a GA, one doesn't know enough about the problem ahead of time to 118
come up with the best encoding for the GA. In fact, coming up with the best encoding is almost tantamount to solving the problem itself! An example of this was seen in the discussion on evolving cellular automata in chapter 2 above. The original lexicographic ordering of bits was arbitrary, and it probably impeded the GA from finding better solutions quickly—to find high−fitness rules, many bits spread throughout the string had to be coadapted. If these bits were close together on the string, so that they were less likely to be separated under crossover, the performance of the GA would presumably be improved. But we had no idea how best to order the bits ahead of time for this problem. This is known in the GA literature as the "linkage problem"—one wants to have functionally related loci be more likely to stay together on the string under crossover, but it is not clear how this is to be done without knowing ahead of time which loci are important in useful schemas. Faced with this problem, and having notions of evolution and adaptation already primed in the mind, many users have a revelation: "As long as I'm using a GA to solve the problem, why not have it adapt the encoding at the same time!" A second reason for adapting the encoding is that a fixed−length representation limits the complexity of the candidate solutions. For example, in the Prisoner's Dilemma example, Axelrod fixed the memory of the evolving strategies to three games, requiring a chromosome of length 64 plus a few extra bits to encode initial conditions. But it would be interesting to know what types of strategies could evolve if the memory size were allowed to increase or decrease (requiring variable−length chromosomes). As was mentioned earlier, such an experiment was done by Lindgren (1992), in which "gene doubling" and "deletion" operators allowed the chromosome length—and thus the potential memory size—to increase and decrease over time, permitting more "open−ended" evolution. Likewise, tree encodings such as those used in genetic programming automatically allow for adaptation of the encoding, since under crossover and mutation the trees can grow or shrink. Meyer and Packard's encoding of condition sets also allowed for individuals of varying lengths, since crossovers between individuals of different lengths could cause the number of conditions in a set to increase or decrease. Other work along these lines has been done by Schaefer (1987), Harp and Samad (1991), Harvey (1992), Schraudolph and Belew (1992), and Altenberg (1994). Below I describe in detail three (of the many) approaches to adapting the encoding for a GA. Inversion Chapter 4: Theoretical Foundations of Genetic Algorithms Holland (1975) included proposals for adapting the encodings in his original proposal for GAs (also see Goldberg (1989a). Holland, acutely aware that correct linkage is essential for single−point crossover to work well, proposed an "inversion" operator specifically to deal with the linkage problem in fixed−length strings. Inversion is a reordering operator inspired by a similar operator in real genetics. Unlike simple GAs, in real genetics the function of a gene is often independent of its position in the chromosome (though often genes in a local area work together in a regulatory network), so inverting part of the chromosome will retain much or all of the "semantics" of the original chromosome. To use inversion in GAs, we have to find some way for the functional interpretation of an allele to be the same no matter where it appears in the string. For example, in the chromosome encoding a cellular automaton (see section 2.1), the leftmost bit under lexicographic ordering is the output bit for the neighborhood of all zeros. We would want that bit to represent that same neighborhood even if its position were changed in the string under an inversion. Holland proposed that each allele be given an index indicating its "real" position, to be used when evaluating a chromosome's fitness. For example, the string 00010101 would be encoded as with the first member of each pair giving the "real" position of the given allele. This is the same string as, say, 119
Page 2 and 3:
An Introduction to Genetic Algorith
Page 4 and 5:
Table of Contents Chapter 4: Theore
Page 6 and 7:
Chapter 1: Genetic Algorithms: An O
Page 8 and 9:
Page 10 and 11:
1.4 SEARCH SPACES AND FITNESS LANDS
Page 12 and 13:
potential energy is a measure of ho
Page 14 and 15:
A simple method of implementing fit
Page 16 and 17:
from an initial state to a goal. Fo
Page 18 and 19:
Robert Axelrod of the University of
Page 20 and 21:
Page 22 and 23:
Page 24 and 25:
Page 26 and 27:
instances of H at time t, and let
Page 28 and 29:
What is the total payoff after 10 g
Page 30 and 31:
6. * c. d. a. b. Chapter 1: Genetic
Page 32 and 33:
As a simple example, consider a pro
Page 34 and 35:
Chapter 2: Genetic Algorithms in Pr
Page 36 and 37:
it to get one fitness case correct:
Page 38 and 39:
Evolving Cellular Automata Chapter
Page 40 and 41:
Page 42 and 43:
up most of the computation time. Ch
Page 44 and 45:
Page 46 and 47:
the prospect of using GAs to automa
Page 48 and 49:
where Ã is the standard deviation
Page 50 and 51:
Page 52 and 53:
Page 54 and 55:
the brain. In a feedforward network
Page 56 and 57:
Page 58 and 59:
Figure 2.21: An illustration of Mil
Page 60 and 61:
Page 62 and 63:
experiments with grammatical encodi
Page 64 and 65:
function of the average error of th
Page 66 and 67:
COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 68 and 69:
Page 70 and 71:
simulated generations, and such sim
Page 72 and 73: environments, they can fairly quick
Page 74 and 75: Chapter 3: Genetic Algorithms in Sc
Page 76 and 77: Figure 3.6: A schematic illustratio
Page 82 and 83: an initial population in which the
Page 88 and 89: these random effects, Bedau and Pac
Page 90 and 91: 3. * 4. * the female child in the n
Page 92 and 93: calculating the observed average fi
Page 94 and 95: (4.1) The goal is to find n = n* th
Page 96 and 97: competition in the d *···* parti
Page 98 and 99: schemas with the best observed fitn
Page 100 and 101: Steepest−ascent hill climbing (SA
Page 102 and 103: (If the algorithm spends only 1/m o
Page 104 and 105: strings—then in principle crossov
Page 106 and 107: (4.7) Chapter 4: Theoretical Founda
Page 108 and 109: 2. 3. Calculate the fitness f(x) of
Page 110 and 111: Defining ri,j(k) and is somewhat tr
Page 112 and 113: Results of the Formalization How ca
Page 114 and 115: Chapter 4: Theoretical Foundations
Page 118 and 119: Figure 4.5: Predicted and observed
Page 120 and 121: COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 124 and 125: Inversion works by choosing two poi
Page 128 and 129: The one hitch is that, as was seen
Page 130 and 131: used in the work of Tanese (1989),
Page 132 and 133: Two individuals are chosen at rando
Page 136 and 137: strategies community, in which para
Page 138 and 139: * 10. * 11. * 12. * Chapter 4: Theo
Page 140 and 141: Chapter 6: Conclusions and Future D
Page 142 and 143: Chapter 6: Conclusions and Future D
Page 144 and 145: Appendix A: Selected General Refere
Page 146 and 147: Evolution Artificielle Foundations
Page 148 and 149: Kaufmann. Appendix B: Other Resourc
Page 150 and 151: Morgan Kaufmann. Appendix B: Other
Page 152 and 153: Appendix B: Other Resources Forrest
Page 154 and 155: Appendix B: Other Resources Grefens
Page 156 and 157: Kirkpatrick, S., Gelatt, C.D., Jr.,
Page 158 and 159: Appendix B: Other Resources Mitchel
Page 160 and 161: Appendix B: Other Resources Roughga
Page 162: Appendix B: Other Resources Vose, M
show all

An Introduction to Genetic Algorithms - Boente

Create successful ePaper yourself

Delete template?

Save as template?