23.02.2015 Views

Machine Learning - DISCo

Machine Learning - DISCo

Machine Learning - DISCo

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

explaining to itself the reason for this conflict and creating a rule such as the one<br />

above. The net effect is that PRODIGY uses domain-independent knowledge about<br />

possible subgoal conflicts, together with domain-specific knowledge of specific<br />

operators (e.g., the fact that the robot can pick up only one block at a time), to<br />

learn useful domain-specific planning rules such as the one illustrated above.<br />

The use of explanation-based learning to acquire control knowledge for<br />

PRODIGY has been demonstrated in a variety of problem domains including the<br />

simple block-stacking problem above, as well as more complex scheduling and<br />

planning problems. Minton (1988) reports experiments in three problem domains,<br />

in which the learned control rules improve problem-solving efficiency by a factor<br />

of two to four. Furthermore, the performance of these learned rules is comparable<br />

to that of handwritten rules across these three problem domains. Minton also describes<br />

a number of extensions to the basic explanation-based learning procedure<br />

that improve its effectiveness for learning control knowledge. These include methods<br />

for simplifying learned rules and for removing learned rules whose benefits<br />

are smaller than their cost.<br />

A second example of a general problem-solving architecture that incorporates<br />

a form of explanation-based learning is the SOAR system (Laird et al. 1986;<br />

Newel1 1990). SOAR supports a broad variety of problem-solving strategies that<br />

subsumes PRODIGY'S means-ends planning strategy. Like PRODIGY, however, SOAR<br />

learns by explaining situations in which its current search strategy leads to inefficiencies.<br />

When it encounters a search choice for which it does not have a definite<br />

answer (e.g., which operator to apply next) SOAR reflects on this search impasse,<br />

using weak methods such as generate-and-test to determine the correct course of<br />

action. The reasoning used to resolve this impasse can be interpreted as an explanation<br />

for how to resolve similar impasses in the future. SOAR uses a variant of<br />

explanation-based learning called chunking to extract the general conditions under<br />

which the same explanation applies. SOAR has been applied in a great number<br />

of problem domains and has also been proposed as a psychologically plausible<br />

model of human learning processes (see Newel1 1990).<br />

PRODIGY and SOAR demonstrate that explanation-based learning methods can<br />

be successfully applied to acquire search control knowledge in a variety of problem<br />

domains. Nevertheless, many or most heuristic search programs still use numerical<br />

evaluation functions similar to the one described in Chapter 1, rather than rules<br />

acquired by explanation-based learning. What is the reason for this? In fact, there<br />

are significant practical problems with applying EBL to learning search control.<br />

First, in many cases the number of control rules that must be learned is very large<br />

(e.g., many thousands of rules). As the system learns more and more control rules<br />

to improve its search, it must pay a larger and larger cost at each step to match this<br />

set of rules against the current search state. Note this problem is not specific to<br />

explanation-based learning; it will occur for any system that represents its learned<br />

knowledge by a growing set of rules. Efficient algorithms for matching rules can<br />

alleviate this problem, but not eliminate it completely. Minton (1988) discusses<br />

strategies for empirically estimating the computational cost and benefit of each<br />

rule, learning rules only when the estimated benefits outweigh the estimated costs

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!