An Introduction to Genetic Algorithms - Boente

More documents

Recommendations

Info

the prospect of using GAs to automatically evolve computation in more complex systems. Moreover, evolving CAs with GAs also gives us a tractable framework in which to study the mechanisms by which an evolutionary process might create complex coordinated behavior in natural decentralized distributed systems. For example, by studying the GA's behavior, we have already learned how evolution's breaking of symmetries can lead to suboptimal computational strategies; eventually we may be able to use such computer models to test ways in which such symmetry breaking might occur in natural evolution. 2.2 DATA ANALYSIS AND PREDICTION A major impediment to scientific progress in many fields is the inability to make sense of the huge amounts of data that have been collected via experiment or computer simulation. In the fields of statistics and machine learning there have been major efforts to develop automatic methods for finding significant and interesting patterns in complex data, and for forecasting the future from such data; in general, however, the success of such efforts has been limited, and the automatic analysis of complex data remains an open problem. Data analysis and prediction can often be formulated as search problems—for example, a search for a model explaining the data, a search for prediction rules, or a search for a particular structure or scenario well predicted by the data. In this section I describe two projects in which a genetic algorithm is used to solve such search problems —one of predicting dynamical systems, and the other of predicting the structure of proteins. Predicting Dynamical Systems Chapter 2: Genetic Algorithms in Problem Solving Norman Packard (1990) has developed a form of the GA to address this problem and has applied his method to several data analysis and prediction problems. The general problem can be stated as follows: A series of observations from some process (e.g., a physical system or a formal dynamical system) take the form of a set of pairs, where are independent variables and y i is a dependent variable (1didN). For example, in a weather prediction task, the independent variables might be some set of features of today's weather (e.g., average humidity, average barometric pressure, low and high temperature, whether or not it rained), and the dependent variable might be a feature of tomorrow's weather (e.g., rain). In a stock market prediction task, the independent variables might be representing the values of the value of a particular stock (the "state variable") at successive time steps, and the dependent variable might be y=x(tn + k), representing the value of the stock at some time in the future. (In these examples there is only one dependent variable y for each vector of independent variables ; a more general form of the problem would allow a vector of dependent variables for each vector of independent variables.) Packard used a GA to search through the space of sets of conditions on the independent variables for those sets of conditions that give good predictions for the dependent variable. For example, in the stock market prediction task, an individual in the GA population might be a set of conditions such as 42
Chapter 2: Genetic Algorithms in Problem Solving where "^" is the logical operator "AND" This individual represents all the sets of three days in which the given conditions were met (possibly the empty set if the conditions are never met). Such a condition set C thus specifies a particular subset of the data points (here, the set of all 3−day periods). Packard's goal was to use a GA to search for condition sets that are good predictors of something—in other words, to search for condition sets that specify subsets of data points whose dependent−variable values are close to being uniform. In the stock market example, if the GA found Figure 2.11: Plot of a time series from Mackey−Glass equation with Ä = 150. Time is plotted on the horizontal axis; x(t)s> is plotted on the vertical axis. (Reprinted from Martin Casdagli and Stephen Eubank, eds., Nonlinear Modeling and Forecasting ; © 1992 Addison−Wesley Publishing Company, Inc. Reprinted by permission of the publisher.) a condition set such that all the days satisfying that set were followed by days on which the price of Xerox stock rose to approximately $30, then we might be confident to predict that, if those conditions were satisfied today, Xerox stock will go up. The fitness of each individual C is calculated by running all the data points ( y) in the training set through C and, for each that satisfies C, collecting the corresponding y. After this has been done, a measurement is made of the uniformity of the resulting values of y. If the y values are all close to a particular value Å, then C is a candidate for a good predictor for y—that is, one can hope that a new that satisfies C will also correspond to a y value close to Å. On the other hand, if the y values are very different from one another, then satisfying C does not seem to predict anything about the corresponding y value. As an illustration of this approach, I will describe the work done by Thomas Meyer and Norman Packard (1992) on finding "regions of predictability" in time series generated by the Mackey−Glass equation, a chaotic dynamical system created as a model for blood flow (Mackey and Glass 1977): Here x(t) is the state variable, t is time in seconds, and a, b, c, and Ä are constants. A time series from this system (with Ä set to 150) is plotted in figure 2.11. To form the data set, Meyer and Packard did the following: For each data point i, the independent variables are 50 consecutive values of x(t) (one per second): The dependent variable for data point i, y i , is the state variable t' time steps in the future: y i = x I 50 + t'. Each data point is formed by iterating the Mackey−Glass equation with a different initial condition, where an initial condition consists of values for {x1Ä,…, x 0}. Meyer and Packard used the following as a fitness function: 43
Page 2 and 3: An Introduction to Genetic Algorith
Page 4 and 5: Table of Contents Chapter 4: Theore
Page 6 and 7: Chapter 1: Genetic Algorithms: An O
Page 10 and 11: 1.4 SEARCH SPACES AND FITNESS LANDS
Page 12 and 13: potential energy is a measure of ho
Page 14 and 15: A simple method of implementing fit
Page 16 and 17: from an initial state to a goal. Fo
Page 18 and 19: Robert Axelrod of the University of
Page 26 and 27: instances of H at time t, and let
Page 28 and 29: What is the total payoff after 10 g
Page 30 and 31: 6. * c. d. a. b. Chapter 1: Genetic
Page 32 and 33: As a simple example, consider a pro
Page 34 and 35: Chapter 2: Genetic Algorithms in Pr
Page 36 and 37: it to get one fitness case correct:
Page 38 and 39: Evolving Cellular Automata Chapter
Page 42 and 43: up most of the computation time. Ch
Page 48 and 49: where Ã is the standard deviation
Page 54 and 55: the brain. In a feedforward network
Page 58 and 59: Figure 2.21: An illustration of Mil
Page 62 and 63: experiments with grammatical encodi
Page 64 and 65: function of the average error of th
Page 66 and 67: COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 70 and 71: simulated generations, and such sim
Page 72 and 73: environments, they can fairly quick
Page 74 and 75: Chapter 3: Genetic Algorithms in Sc
Page 76 and 77: Figure 3.6: A schematic illustratio
Page 82 and 83: an initial population in which the
Page 88 and 89: these random effects, Bedau and Pac
Page 90 and 91: 3. * 4. * the female child in the n
Page 92 and 93: calculating the observed average fi
Page 94 and 95: (4.1) The goal is to find n = n* th
Page 96 and 97:
competition in the d *···* parti
Page 98 and 99:
schemas with the best observed fitn
Page 100 and 101:
Steepest−ascent hill climbing (SA
Page 102 and 103:
(If the algorithm spends only 1/m o
Page 104 and 105:
strings—then in principle crossov
Page 106 and 107:
(4.7) Chapter 4: Theoretical Founda
Page 108 and 109:
2. 3. Calculate the fitness f(x) of
Page 110 and 111:
Defining ri,j(k) and is somewhat tr
Page 112 and 113:
Results of the Formalization How ca
Page 114 and 115:
Chapter 4: Theoretical Foundations
Page 116 and 117:
Page 118 and 119:
Figure 4.5: Predicted and observed
Page 120 and 121:
COMPUTER EXERCISES 1. 2. 3. 4. 5. 6
Page 122 and 123:
prone to rather arbitrary orderings
Page 124 and 125:
Inversion works by choosing two poi
Page 126 and 127:
Page 128 and 129:
The one hitch is that, as was seen
Page 130 and 131:
used in the work of Tanese (1989),
Page 132 and 133:
Two individuals are chosen at rando
Page 134 and 135:
Page 136 and 137:
strategies community, in which para
Page 138 and 139:
* 10. * 11. * 12. * Chapter 4: Theo
Page 140 and 141:
Chapter 6: Conclusions and Future D
Page 142 and 143:
Chapter 6: Conclusions and Future D
Page 144 and 145:
Appendix A: Selected General Refere
Page 146 and 147:
Evolution Artificielle Foundations
Page 148 and 149:
Kaufmann. Appendix B: Other Resourc
Page 150 and 151:
Morgan Kaufmann. Appendix B: Other
Page 152 and 153:
Appendix B: Other Resources Forrest
Page 154 and 155:
Appendix B: Other Resources Grefens
Page 156 and 157:
Kirkpatrick, S., Gelatt, C.D., Jr.,
Page 158 and 159:
Appendix B: Other Resources Mitchel
Page 160 and 161:
Appendix B: Other Resources Roughga
Page 162:
Appendix B: Other Resources Vose, M
show all

An Introduction to Genetic Algorithms - Boente

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?