A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

as propagation and the use of domain independent problem structure textures to guide variable and value selection. The second design goal is to focus Soar’s internal reasoning on the selection of textures measures to facilitate the learning of domain independent problem solving rules ideally useful across different problem types. Architecturally (Figure 1) CHS-Soar is composed of three parts: [1] the Soar kernel; [2] Soar agent (production rules); and, [3] the external agent. A more detailed overview of the design of CHS-Soar is provided by Bittle and Fox [Bittle, Fox, 2008]. Soar Kernel The Soar kernel is invariant and encompasses a fixed set of computational mechanisms including: working memory (state representation); long term memory (repository for production rules); decision cycle (used to link working and long term memory); and, learning system. Soar Agent (Production Rules) Production rules provide the “programming language” for developers to create different Soar programs, called agents, and form the content of Soar’s Long Term Memory. Soar’s general problem-solving ability results from the fact that the same architecture (the Soar kernel) can be used to solve different problems as defined by different sets of production rules. Rules are used to; propose, select and apply operators. Figure 1: CHS-Soar Architecture Operators perform actions and are consequently the locus of decision making [Laird et. al, 1987]. CHS-Soar operators are focused on variable and value texture measure selection. CHS-Soar operators provide the framework to introduce the CHS problem solving cycle. CHS-Soar Problem Solving Cycle Soar’s three-phase decision cycle (proposal, decision, and apply) provides a suitable framework to introduce the CHS problem solving cycle summarized in Table 1. 9 Table 1: CHS Problem Solving Cycle Repeat 1 Conduct propagation (or backtrack) within state • calculate variable texture measures 2 Select variable (based on heuristics as suggested by textures measures) • calculate value texture measures 3 Select value (based on heuristics as suggested by textures measures) Until (all variables values instantiated & all constraints satisfied) As depicted in Figure 2, three Soar decision cycles are performed during one CHS cycle. Instrumental to the learning of domain independent problem-solving rules is the requirement to have the Soar agent component of the CHS cycle only reason about the selection of normalized variable and value textures measures — not actual problem variable and values. Consequently the Soar agent is decoupled from the actual problem constraint graph. The Soar agent interacts with the external agent via production action-side function calls. Figure 2: CHS-Soar Problem Solving Cycle External Agent The external agent is an optional user-defined program where Soar agent production rule functions reside. From the external agent we can register new functions which can extend the functionality of Soar agent productions. The CHS external agent provides five functions including: CSP problem representation (binary constraint graph); constraint propagation using the AC-3 algorithm [Mackworth, 1977]; chronological backtracking; and, variable and value texture calculations. Texture Measures Textures are structural measures of the constraint graph used to guide the order of variable and value selection [Fox et. al., 1989]. In order to demonstrate the future introduction of more insightful texture measures [Fox, 1990], three frequently cited [Kumar, 1992] variable and value ordering measures (and their associated heuristics) are utilized as outlined in Table 2. Table 2: Texture Measures and Associated Heuristics Name Texture Measures Heuristics Minimum Di, Number of remaining Select the variable Remaining values in domain. with the smallest Di, Value (MRV) value Degree Ci, number of constraints Select the variable (DEG) linked to variable. with the largest Ci value Least Fi, number of available Select the value Constraining values in domain of linked with the largest Fi Value (LCV) variables not instantiated. value
Of particular note is the separation of the texture measure from its associated unary heuristic. In order to facilitate learning across different problem sizes and more importantly different problem domains, texture measures are normalized between 0 (minimum value) and 1 (maximum value). A “pruned” (duplicate values removed) list of normalized texture measures are returned to the Soar agent. Pruning has the effect of further decoupling the texture measures from any specific problem domain. Soar and External Agent State Representation The Soar agent initially establishes a binary constraint graph in Soar’s working memory. Once problem solving begins, the Soar agent state representation is defined by the CHS phase (i.e. select variable), state status, desired state, and the collection of normalized texture measures. In contrast, the external agent maintains a complete binary constraint graph problem representation. Subgoaling and Learning Planning in Soar arises out of its impasse detection and substate creation mechanism. Soar automatically creates a subgoal whenever the preferences are insufficient for the decision procedure to select an operator [Laird, et. al, 1987]. Soar includes a set of default production rules that allow subgoaling to be performed using a simple lookahead search which CHS-Soar utilizes to evaluate variable and value texture measure performance. After propagation, the Soar agent has no prior knowledge of which variable and/or value texture measure to select (assuming no previous learning). Soar will detect an impasse and automatically subgoal to evaluate each texture measure returned and now cast as operators. Texture Measure Evaluation CHS-Soar uses three separate, yet related texture measure evaluations. The first is a simple numerical evaluation which gives a higher preference to the texture measure that requires fewer propagation cycles to achieve a candidate solution. Second, if a texture measure leads to a substate domain blow-out it is symbolically evaluated as a failure and rejected. Third, if we cannot match a texture measure in the substate (i.e. an updated constraint graph is generated that does not include the texture measure under evaluation) the texture measure is symbolically evaluated as a “partial failure.” Since CHS requires both variable and value selections to advance a solution, in order to facilitate the evaluation of any substate variable texture measure we are required to make some type of value texture measure commitment leading to a variable – value texture measure pairing. The approach utilized is to allow Soar to further subgoal (i.e. a sub-subgoal) about value texture selection based on the value texture measures generated for each variable texture measure under consideration. Within each sub-substate, 10 the variable — value pairing is held constant and a “candidate” solution is attempted. The second design goal of CHS-Soar is the learning of domain independent problem solving rules — specifically rules that can be transferred and effectively used on different problem types. Chunks are created in the CHS- Soar agent as we resolve impasses. Resulting chunks have as their conditions either unary or binary conditions composed of texture measures (cast as operators). Implementation of CHS-Soar The CHS-Soar agent was developed using Soar version 8.6.3 (Windows versions). The external agent was developed in C++ using Windows Visual Studio 2005. Experiments The paper reports on two selected computational experiments performed to-date. The objective of experiment 1 is to establish the subgoaling, learning and transfer of learning ability of CHS-Soar. Experiment 2 considered the impact of problem size/complexity on the performance of externally learned chunks. Problem instances were run 10 times and averaged. Experiments were conducted for two sample problems: [1] map colouring (MC); [2] job-shop scheduling (JSS), both well known combinatorial problems. The relationship of the map colouring problem and the more formal four-colour problem to general intelligence is highlighted by Swart [Swart, 1980] who notes one of the main attractions of the problem lies in the fact that it is “so simple to state that a child can understand it.” The JSS problem instance is taken from [Pang, et. al, 2000]. Results are presented in terms of “Decisions (cycles)” — the basic unit of reasoning effort in Soar (see Figure 2). Experiment 1: Learning and Transfer of Learning The first experiment investigated the impact of subgoaling, internal learning and the transfer of externally learned rules between problem types. In order to assess the performance of an external set of learned chunks on the map colouring problem, the chunks acquired through internal learning from the JSS problem were introduced for test case 4 (see Table 3) and vise versa for the JSS. Table 3: Learning and Transfer of Learning Test Cases Ref Test Case Description 1 Random Random selection of variable and value texture measures (indifferent). 2 Hard- Coded Explicitly coded MRV, DEG variable selection heuristics and LCV value selection heuristic. 3 Internal Problem is run using internally acquired chunks. 4 External Problem is run using externally acquired chunks from different problem domain.
Page 2 and 3: Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5: Artificial General Intelligence Vol
Page 6: Preface Artificial General Intellig
Page 9 and 10: Organizing Committee Tsvi Achler U.
Page 11 and 12: A formal framework for the symbol g
Page 13 and 14: Importing Space-time Concepts Into
Page 15 and 16: one structure, everything else adap
Page 17 and 18: generalize is essential to understa
Page 19 and 20: First Application The above framewo
Page 21: problem solving, use of knowledge,
Page 25 and 26: Conclusions and Future Work While p
Page 27 and 28: Conceptual Spaces The conceptual ar
Page 29 and 30: perceptions and actions. The lingui
Page 31 and 32: means of the knowledge stored in th
Page 33 and 34: grams given some (syntactically res
Page 35 and 36: For example, let reverse([]) = [] r
Page 37 and 38: Igor2 produced solutions with auxil
Page 39 and 40: evolve neural net modules as quickl
Page 41 and 42: ain”, consisting of some dozen or
Page 43 and 44: Population Chr NListPtr m Chrm is a
Page 45 and 46: and shaping, we feel that exploring
Page 47 and 48: Table 2 reviews the key capabilitie
Page 49 and 50: One way to achieve this goal would
Page 51 and 52: question. Our approach of staying a
Page 53 and 54: H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 55 and 56: Discussion and future work Testing
Page 57 and 58: Types of Reasoning Corresponding Fo
Page 59 and 60: Integrating Different Types of Reas
Page 61 and 62: good overview of analogy models can
Page 63 and 64: ��
Page 65 and 66: � ��
Page 67 and 68: ��
Page 69 and 70: A Unified Framework for IP Conditio
Page 71 and 72: negative evidence when added to a r
Page 73 and 74:
ing strategy. Due to GOLEM’s rand
Page 75 and 76:
any state index, n∈IN the current
Page 77 and 78:
is the best observation summary, wh
Page 79 and 80:
to a code for the rewards only, whi
Page 81 and 82:
have exponentially many states (2O(
Page 83 and 84:
where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86:
• The cost function can be improv
Page 87 and 88:
decides to introduce inflation or d
Page 89 and 90:
€ € The diffusion matrix is the
Page 91 and 92:
tuning may be done adaptively by te
Page 93 and 94:
Of course, Harnad’s argument is n
Page 95 and 96:
By exploring variations of Harnad
Page 97 and 98:
Discussion The representational sys
Page 99 and 100:
the cognitive map is less of a map
Page 101 and 102:
models of cognition except in cases
Page 103 and 104:
7(b), we can see that the straight
Page 105 and 106:
eaction times, error rates, or exac
Page 107 and 108:
comparing behavior with and without
Page 109 and 110:
Doorenbos, R. B. 1994. Combining le
Page 111 and 112:
egion, we leave the type of object
Page 113 and 114:
extract features in support of reco
Page 115 and 116:
with a minimum score of -1.0. The
Page 117 and 118:
the NASA Human Error Modeling compa
Page 119 and 120:
whether their models are capable of
Page 121 and 122:
Incorporating Planning and Reasonin
Page 123 and 124:
see an unclaimed ten-dollar bill al
Page 125 and 126:
swering user queries) and the state
Page 127 and 128:
Program Representation for General
Page 129 and 130:
distribution P corresponding to the
Page 131 and 132:
with all Ei replaced by the unbound
Page 133 and 134:
Consciousness in Human and Machine:
Page 135 and 136:
at the sensory receptors, is pre-pr
Page 137 and 138:
The second choice is to take a deta
Page 139 and 140:
Hebbian Constraint on the Resolutio
Page 141 and 142:
The Case of Inputs that Change by T
Page 143 and 144:
This route is relevant if the noise
Page 145 and 146:
1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148:
Our implemented Turing judge used a
Page 149 and 150:
2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152:
UnsupervisedSegmentationofAudioSpee
Page 153 and 154:
FinallyweusedthediscreteFourierTran
Page 155 and 156:
Secondly,thereexistsmorethanonelogi
Page 157 and 158:
Parsing PCFG within a General Proba
Page 159 and 160:
known in advance, although many imp
Page 161 and 162:
Figure 2: Shows the underlying weig
Page 163 and 164:
Self-Programming: Operationalizing
Page 165 and 166:
expressivity. More over, self-progr
Page 167 and 168:
existence of a distance function ov
Page 169 and 170:
Bootstrap Dialog: A Conversational
Page 171 and 172:
elation is typically a verb. In the
Page 173 and 174:
node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176:
Analytical Inductive Programming as
Page 177 and 178:
Problem domain: puttable(x) PRE: cl
Page 179 and 180:
eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182:
Human and Machine Understanding Of
Page 183 and 184:
case orderings t correspond to the
Page 185 and 186:
in (1) received a different denotat
Page 187 and 188:
Abstract This paper analyzes the di
Page 189 and 190:
Having grounded meaning: In an inte
Page 191 and 192:
to experience” can be argued to b
Page 193 and 194:
Abstract Case-by-case Problem Solvi
Page 195 and 196:
First, since NARS is designed in th
Page 197 and 198:
typical response is to find such an
Page 199 and 200:
What Is Artificial General Intellig
Page 201 and 202:
Finally, the facts that both seem t
Page 203 and 204:
differ. NARS uses Narsese, the fair
Page 205 and 206:
Integrating Action and Reasoning th
Page 207 and 208:
states, with the condition that the
Page 209 and 210:
action model. The system is able to
Page 211 and 212:
Neuroscience and AI Share the Same
Page 213 and 214:
Relevance Based Planning: Why Its a
Page 215 and 216:
General Intelligence and Hypercompu
Page 217 and 218:
To appear, AGI-09 1 Stimulus proces
Page 219 and 220:
Distribution of Environments in For
Page 221 and 222:
The Importance of Being Neural-Symb
Page 223 and 224:
Improving the Believability of Non-
Page 225 and 226:
Understanding the Brain’s Emergen
Page 227 and 228:
Abstract The challenge of creating
Page 229 and 230:
Importing Space-time Concepts Into
Page 231 and 232:
HELEN: Using Brain Regions and Mech
Page 233 and 234:
Holistic Intelligence: Transversal
Page 235 and 236:
Achieving Artificial General Intell
Page 237:
Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?