Evolutionary Computation : A Unified Approach

More documents

Recommendations

Info

5.4. EA-BASED MACHINE LEARNING 107 Using EAs as search procedures for this large class of search problems has the additional benefit of making direct contact with a large and well-established body of computer science literature on search algorithms, and allows one to compare and contrast EA approaches with other, more traditional approaches. As is the case with optimization algorithms, there is no single search procedure that is uniformly best. Rather, each has its own strengths and weaknesses. The features that make EA-based search attractive include: • SEARCH-EAs are relatively easy to parallelize. • SEARCH-EAs have a convenient built-in resource management facility in the sense that the internal fitness-biased competition for parenthood and for survival naturally distributes cpu cycles and focus of attention to the more promising parts of a search space. • SEARCH-EAs are relatively easy to port from one application to another without having to start from scratch. 5.4 EA-Based Machine Learning Machine learning (ML) is another closely related class of problems for which there has been considerable interest in EA-based approaches. Although there are many forms of learning, for our purposes the focus in this section will be on the ability of a system to improve future task-specific performance as a result of related present and past activities. So, for example, we spend time on the driving range hoping to improve our golf handicap, and we read and experiment on paper with a new options-trading scheme before we risk real money. In machine learning, this typically takes the form of being given a set of training examples to “practice” on until sufficiently high performance levels are obtained, and then, with learning turned off, performance is measured on a set of previously unseen examples. At this level of abstraction, the learning process can be viewed as one of inductive generalization in which the features relevant to good performance are identified and integrated into a model general enough to produce acceptable performance on both training and testing examples. Rote memorization, while easy to implement, is seldom sufficient because of the dynamic complexity and continual novelty of most task domains. Rather, a more general model is required, and in machine learning these models often take the form of a decision tree, a set of decision rules, an artificial neural network, etc. A fairly standard machine-learning approach is to adopt a “bottom up” strategy and try to construct an appropriate model directly from the training examples. One’s ability to do so effectively depends heavily on the kinds of tasks involved and the types of models to be constructed. For example, learning classification tasks, such as recognizing and categorizing objects described by feature vectors, appears to be quite amenable to “bottom-up” approaches, while learning sequential decision tasks, such as robot navigation, does not. Alternatively, one can view the machine learning process as more of a “top-down” process in which one searches a model space for models that work well on the training examples and are of sufficient generality to have a high probability of success on not yet encountered examples. The concern with this approach is most often one of efficiency, since spaces
108 CHAPTER 5. EVOLUTIONARY ALGORITHMS AS PROBLEM SOLVERS of models of sufficient descriptive power are generally so large as to prohibit any form of systematic search. However, as we discussed in the previous section, EAs can often be used to search large complex spaces effectively, including the model spaces of interest here. Consequently, ML-EAs tend to adopt a more top-down approach of searching a model space by means of a population of models that compete with each other, not unlike scientific theories in the natural sciences. In order for this to be effective, a notion of model “fitness” must be provided in order to bias the search process in a useful way. The first thought one might have is to define the fitness of a model in terms of its observed performance on the provided training examples. However, just as we see in other approaches, ML-EAs with no other feedback or bias will tend to overfit the training data at the expense of generality. So, for example, if an ML-EA is attempting to learn a set of classification rules using only training set performance as the measure of fitness, it is not at all unusual to see near-perfect rule sets emerge that consist of approximately one rule per training example! Stated another way, for a given set of training examples, there are a large number of theories (models) that have identical performance on the training data, but can have quite different predictive power. Since, by definition, we cannot learn from unseen examples, this generality must be achieved by other means. A standard technique for doing so is to adopt some sort of “Occam’s razor” approach: all other things being equal, select a simpler model over a more complex one. For ML-EAs this typically is achieved by augmenting the fitness function to include both performance of the training data and the parsimony of the model. Precise measurements of parsimony can be quite difficult to define in general. However, rough estimates based on the size of a model measured in terms of its basic building blocks have been shown to be surprisingly effective. For example, using the number of rules in a rule set or the number of hidden nodes in an artificial neural network as an estimate of parsimony works quite well. Somewhat more difficult from an ML-EA designer’s point of view is finding an appropriate balance between parsimony and performance. If one puts too much weight on generality, performance will suffer and vice versa. As we will see in a later section in this chapter, how best to apply EAs to problems involving multiple conflicting objectives is a challenging problem in general. To keep things simple, most ML-EAs adopt a fairly direct approach such as: fitness(model) =performance(model) − w ∗ parsimony(model) where the weight w is empirically chosen to discourage overfitting, and parsimony is a simple linear or quadratic function of model size (Smith, 1983; Bassett and De Jong, 2000). Just as was the case in the previous sections, applying EAs to machine-learning problems is not some sort of magic wand that renders existing ML techniques obsolete. Rather, ML- EAs complement existing approaches in a number of useful ways: • Many of the existing ML techniques are designed to learn “one-shot” classification tasks, in which problems are presented as precomputed feature vectors to be classified as belonging to a fixed set of categories. When ML-EAs are applied to such problems, they tend to converge more slowly than traditional ML techniques but often to more parsimonious models (De Jong, 1988).
Page 2:
Evolutionary Computation
Page 5 and 6:
c○ 2006 Massachusetts Institute o
Page 7 and 8:
vi CONTENTS 4 A Unified View of Sim
Page 9 and 10:
viii CONTENTS 6.5.7 Selection, Repr
Page 12 and 13:
Chapter 1 Introduction The field of
Page 14 and 15:
1.2. EV: A SIMPLE EVOLUTIONARY SYST
Page 16 and 17:
1.2. EV: A SIMPLE EVOLUTIONARY SYST
Page 18 and 19:
1.3. EV ON A SIMPLE FITNESS LANDSCA
Page 20 and 21:
Page 22 and 23:
Page 24 and 25:
Page 26 and 27:
1.4. EV ON A MORE COMPLEX FITNESS L
Page 28 and 29:
1.4. EV ON A MORE COMPLEX FITNESS L
Page 30 and 31:
1.5. EVOLUTIONARY SYSTEMS AS PROBLE
Page 32:
1.6. EXERCISES 21 1.6 Exercises 1.
Page 35 and 36:
24 CHAPTER 2. A HISTORICAL PERSPECT
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 44 and 45:
Chapter 3 Canonical Evolutionary Al
Page 46 and 47:
3.3. EVOLUTIONARY PROGRAMMING 35 55
Page 48 and 49:
3.4. EVOLUTION STRATEGIES 37 55 50
Page 50 and 51:
3.4. EVOLUTION STRATEGIES 39 55 50
Page 52 and 53:
3.5. GENETIC ALGORITHMS 41 Define s
Page 54 and 55:
3.5. GENETIC ALGORITHMS 43 3.5.2 Un
Page 56 and 57:
3.5. GENETIC ALGORITHMS 45 Populati
Page 58:
3.6. SUMMARY 47 3.6 Summary The can
Page 61 and 62:
50 CHAPTER 4. A UNIFIED VIEW OF SIM
Page 63 and 64:
Page 65 and 66:
Page 67 and 68: 56 CHAPTER 4. A UNIFIED VIEW OF SIM
Page 82 and 83: Chapter 5 Evolutionary Algorithms a
Page 84 and 85: 5.1. SIMPLE EAS AS PARALLEL ADAPTIV
Page 92 and 93: 5.2. EA-BASED OPTIMIZATION 81 them
Page 94 and 95: 5.2. EA-BASED OPTIMIZATION 83 of th
Page 96 and 97: 5.2. EA-BASED OPTIMIZATION 85 one o
Page 98 and 99: 5.2. EA-BASED OPTIMIZATION 87 60 40
Page 100 and 101: 5.2. EA-BASED OPTIMIZATION 89 Notic
Page 102 and 103: 5.2. EA-BASED OPTIMIZATION 91 120 1
Page 104 and 105: 5.2. EA-BASED OPTIMIZATION 93 parti
Page 106 and 107: 5.2. EA-BASED OPTIMIZATION 95 5 0 O
Page 108 and 109: 5.2. EA-BASED OPTIMIZATION 97 5.2.2
Page 110 and 111: 5.2. EA-BASED OPTIMIZATION 99 of tr
Page 112 and 113: 5.2. EA-BASED OPTIMIZATION 101 700
Page 114 and 115: 5.2. EA-BASED OPTIMIZATION 103 5.2.
Page 116 and 117: 5.3. EA-BASED SEARCH 105 • In the
Page 120 and 121: 5.5. EA-BASED AUTOMATED PROGRAMMING
Page 122 and 123: 5.5. EA-BASED AUTOMATED PROGRAMMING
Page 124: 5.7. SUMMARY 113 the effect of impr
Page 127 and 128: 116 CHAPTER 6. EVOLUTIONARY COMPUTA
Page 169 and 170:
158 CHAPTER 6. EVOLUTIONARY COMPUTA
Page 171 and 172:
Page 173 and 174:
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Page 189 and 190:
Page 191 and 192:
Page 193 and 194:
Page 195 and 196:
Page 197 and 198:
Page 199 and 200:
Page 201 and 202:
Page 203 and 204:
Page 205 and 206:
Page 207 and 208:
Page 209 and 210:
Page 211 and 212:
Page 213 and 214:
Page 215 and 216:
Page 217 and 218:
Page 219 and 220:
Page 222 and 223:
Chapter 7 Advanced EC Topics The pr
Page 224 and 225:
7.2. DYNAMIC LANDSCAPES 213 nested
Page 226 and 227:
7.2. DYNAMIC LANDSCAPES 215 10 8 Fi
Page 228 and 229:
7.2. DYNAMIC LANDSCAPES 217 10 8 Fi
Page 230 and 231:
7.3. EXPLOITING PARALLELISM 219 7.2
Page 232 and 233:
7.4. EVOLVING EXECUTABLE OBJECTS 22
Page 234 and 235:
7.5. MULTI-OBJECTIVE EAS 223 be of
Page 236 and 237:
7.7. BIOLOGICALLY INSPIRED EXTENSIO
Page 238 and 239:
Page 240 and 241:
Page 242 and 243:
Chapter 8 The Road Ahead As a well-
Page 244 and 245:
Appendix A Source Code Overview Rat
Page 246 and 247:
A.1. EC1: A VERY SIMPLE EC SYSTEM 2
Page 248 and 249:
A.3. EC3: A MORE FLEXIBLE EC SYSTEM
Page 250 and 251:
A.3. EC3: A MORE FLEXIBLE EC SYSTEM
Page 252 and 253:
Bibliography Altenberg, L. (1994).
Page 254 and 255:
BIBLIOGRAPHY 243 Collins, R. and D.
Page 256 and 257:
BIBLIOGRAPHY 245 Frank, S. A. (1995
Page 258 and 259:
BIBLIOGRAPHY 247 Jansen, T., K. De
Page 260 and 261:
BIBLIOGRAPHY 249 Potter, M. A., J.
Page 262 and 263:
BIBLIOGRAPHY 251 Spears, W. (2000).
Page 264 and 265:
Index 1-point crossover operator, 2
Page 266 and 267:
INDEX 255 graph structures, 74-76,
show all

Evolutionary Computation : A Unified Approach

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?