An Introduction to Genetic Algorithms - Boente

More documents

Recommendations

Info

Figure 2.21: An illustration of Miller, Todd, and Hegde's representation scheme. Each entry in the matrix represents the type of connection on the link between the "from unit" (column) and the "to unit" (row). The rows of the matrix are strung together to make the bit−string encoding of the network, given at the bottom of the figure. The resulting network is shown at the right. (Adapted from Miller, Todd, and Hegde 1989.) rather, it evolves grammars that can be used to develop network architectures. Direct Encoding The method of direct encoding is illustrated in work done by Geoffrey Miller, Peter Todd, and Shailesh Hegde (1989), who restricted their initial project to feedforward networks with a fixed number of units for which the GA was to evolve the connection topology. As is shown in figure 2.21, the connection topology was represented by an N x N matrix (5 x 5 in figure 2.21) in which each entry encodes the type of connection from the "from unit" to the "to unit." The entries in the connectivity matrix were either "0" (meaning no connection) or "L" (meaning a "learnable" connection—i.e., one for which the weight can be changed through learning). Figure 2.21 also shows how the connectivity matrix was transformed into a chromosome for the GA ("O" corresponds to 0 and "L" to 1) and how the bit string was decoded into a network. Connections that were specified to be learnable were initialized with small random weights. Since Miller, Todd, and Hegde restricted these networks to be feedforward, any connections to input units or feedback connections specified in the chromosome were ignored. Miller, Todd, and Hegde used a simple fitness−proportionate selection method and mutation (bits in the string were flipped with some low probability). Their crossover operator randomly chose a row index and swapped the corresponding rows between the two parents to create two offspring. The intuition behind that operator was similar to that behind Montana and Davis's crossover operator—each row represented all the incoming connections to a single unit, and this set was thought to be a functional building block of the network. The fitness of a chromosome was calculated in the same way as in Montana and Davis's project: for a given problem, the network was trained on a training set for a certain number of epochs, using back−propagation to modify the weights. The fitness of the chromosome was the sum of the squares of the errors on the training set at the last epoch. Again, low error translated to high fitness. Miller, Todd, and Hegde tried their GA on three tasks: Chapter 2: Genetic Algorithms in Problem Solving XOR: The single output unit should turn on (i.e., its activation should be above a set threshold) if the exclusive−or of the initial values (1 = on and 0 = off) of the two input units is 1. Four Quadrant: The real−valued activations (between 0.0 and 1.0) of the two input units represent the coordinates of a point in a unit square. All inputs representing points in the lower left and upper right quadrants of the square should produce an activation of 0.0 on the single output unit, and all other points should produce an output activation of 1.0. Encoder/Decoder (Pattern Copying): The output units (equal in number to the input units) should copy the 54
initial pattern on the input units. This would be trivial, except that the number of hidden units is smaller than the number of input units, so some encoding and decoding must be done. These are all relatively easy problems for multi−layer neural networks to learn to solve under back−propagation. The networks had different numbers of units for different tasks (ranging from 5 units for the XOR task to 20 units for the encoder/decoder task); the goal was to see if the GA could discover a good connection topology for each task. For each run the population size was 50, the crossover rate was 0.6, and the mutation rate was 0.005. In all three tasks, the GA was easily able to find networks that readily learned to map inputs to outputs over the training set with little error. However, the three tasks were too easy to be a rigorous test of this method—it remains to be seen if this method can scale up to more complex tasks that require much larger networks with many more interconnections. I chose the project of Miller, Todd, and Hegde to illustrate this approach because of its simplicity. For several examples of more sophisticated approaches to evolving network architectures using direct encoding, see Whitley and Schaffer 1992. Grammatical Encoding Chapter 2: Genetic Algorithms in Problem Solving The method of grammatical encoding can be illustrated by the work of Hiroaki Kitano (1990), who points out that direct−encoding approachs become increasingly difficult to use as the size of the desired network increases. As the network's size grows, the size of the required chromosome increases quickly, which leads to problems both in performance (how high a fitness can be obtained) and in efficiency (how long it takes to obtain high fitness). In addition, since direct−encoding methods explicitly represent each connection in the network, repeated or nested structures cannot be represented efficiently, even though these are common for some problems. The solution pursued by Kitano and others is to encode networks as grammars; the GA evolves the grammars, but the fitness is tested only after a "development" step in which a network develops from the grammar. That is, the "genotype" is a grammar, and the "phenotype" is a network derived from that grammar. A grammar is a set of rules that can be applied to produce a set of structures (e.g., sentences in a natural language, programs in a computer language, neural network architectures). A simple example is the following grammar: Here S is the start symbol and a nonterminal, a and b are terminals, and µ is the empty−string terminal.(S ’ µ means that S can be replaced by the empty string.) To construct a structure from this grammar, start with S, and replace it by one of the allowed replacements given by the righthand sides (e.g., S ’ aSb). Now take the resulting structure and replace any nonterminal (here S) by one of its allowed replacements (e.g., aSb ’ aaSbb). Continue in this way until no nonterminals are left (e.g., aaSbb ’ aabb, using S ’ µ). It can easily be shown that the set of structures that can be produced by this grammar are exactly the strings a n b n consisting of the same number of as and bs with all the as on the left and all the bs on the right. Kitano applied this general idea to the development of neural networks using a type of grammar called a "graph−generation grammar," a simple example of which is given in figure 2.22a Here the right−hand side of each rule is a 2 × 2 matrix rather than a one−dimensional string. Capital letters are nonterminals, and lower−case letters are terminals. Each lower−case letter from a through p represents one of the 16 possible 2 × 2 arrays of ones and zeros. In contrast to the grammar fora n b n given above, each nonterminal in this particular grammar has exactly one right−hand side, so there is only one structure that can be formed from this grammar: the 8 x 8 matrix shown in figure 2.22b This matrix can be interpreted as a connection matrix for a 55
Page 2 and 3:
An Introduction to Genetic Algorith
Page 4 and 5:
Table of Contents Chapter 4: Theore
Page 6 and 7:
Chapter 1: Genetic Algorithms: An O
Page 8 and 9: Chapter 1: Genetic Algorithms: An O
Page 10 and 11: 1.4 SEARCH SPACES AND FITNESS LANDS
Page 12 and 13: potential energy is a measure of ho
Page 14 and 15: A simple method of implementing fit
Page 16 and 17: from an initial state to a goal. Fo
Page 18 and 19: Robert Axelrod of the University of
Page 26 and 27: instances of H at time t, and let
Page 28 and 29: What is the total payoff after 10 g
Page 30 and 31: 6. * c. d. a. b. Chapter 1: Genetic
Page 32 and 33: As a simple example, consider a pro
Page 34 and 35: Chapter 2: Genetic Algorithms in Pr
Page 36 and 37: it to get one fitness case correct:
Page 38 and 39: Evolving Cellular Automata Chapter
Page 42 and 43: up most of the computation time. Ch
Page 46 and 47: the prospect of using GAs to automa
Page 48 and 49: where Ã is the standard deviation
Page 54 and 55: the brain. In a feedforward network
Page 62 and 63: experiments with grammatical encodi
Page 64 and 65: function of the average error of th
Page 66 and 67: COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 70 and 71: simulated generations, and such sim
Page 72 and 73: environments, they can fairly quick
Page 74 and 75: Chapter 3: Genetic Algorithms in Sc
Page 76 and 77: Figure 3.6: A schematic illustratio
Page 82 and 83: an initial population in which the
Page 88 and 89: these random effects, Bedau and Pac
Page 90 and 91: 3. * 4. * the female child in the n
Page 92 and 93: calculating the observed average fi
Page 94 and 95: (4.1) The goal is to find n = n* th
Page 96 and 97: competition in the d *···* parti
Page 98 and 99: schemas with the best observed fitn
Page 100 and 101: Steepest−ascent hill climbing (SA
Page 102 and 103: (If the algorithm spends only 1/m o
Page 104 and 105: strings—then in principle crossov
Page 106 and 107: (4.7) Chapter 4: Theoretical Founda
Page 108 and 109:
2. 3. Calculate the fitness f(x) of
Page 110 and 111:
Defining ri,j(k) and is somewhat tr
Page 112 and 113:
Results of the Formalization How ca
Page 114 and 115:
Chapter 4: Theoretical Foundations
Page 116 and 117:
Page 118 and 119:
Figure 4.5: Predicted and observed
Page 120 and 121:
COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 122 and 123:
prone to rather arbitrary orderings
Page 124 and 125:
Inversion works by choosing two poi
Page 126 and 127:
Page 128 and 129:
The one hitch is that, as was seen
Page 130 and 131:
used in the work of Tanese (1989),
Page 132 and 133:
Two individuals are chosen at rando
Page 134 and 135:
Page 136 and 137:
strategies community, in which para
Page 138 and 139:
* 10. * 11. * 12. * Chapter 4: Theo
Page 140 and 141:
Chapter 6: Conclusions and Future D
Page 142 and 143:
Chapter 6: Conclusions and Future D
Page 144 and 145:
Appendix A: Selected General Refere
Page 146 and 147:
Evolution Artificielle Foundations
Page 148 and 149:
Kaufmann. Appendix B: Other Resourc
Page 150 and 151:
Morgan Kaufmann. Appendix B: Other
Page 152 and 153:
Appendix B: Other Resources Forrest
Page 154 and 155:
Appendix B: Other Resources Grefens
Page 156 and 157:
Kirkpatrick, S., Gelatt, C.D., Jr.,
Page 158 and 159:
Appendix B: Other Resources Mitchel
Page 160 and 161:
Appendix B: Other Resources Roughga
Page 162:
Appendix B: Other Resources Vose, M
show all

An Introduction to Genetic Algorithms - Boente

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?