A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

Extending Cognitive Architectures with Mental Imagery Abstract Inspired by mental imagery, we present results of extending a symbolic cognitive architecture (Soar) with general computational mechanisms to support reasoning with symbolic, quantitative spatial, and visual depictive representations. Our primary goal is to achieve new capabilities by combining and manipulating these representations using specialized processing units specific to a modality but independent of task knowledge. This paper describes the architecture supporting behavior in an environment where perceptual-based thought is inherent to problem solving. Our results show that imagery provides the agent with additional functional capabilities improving its ability to solve rich spatial and visual problems. Introduction The generality and compositional power of sentential, symbolic processing has made it central to reasoning in general AI systems. However, these general symbolic systems have failed to address and account for inherently perceptual, modality-specific processing that some argue should participate directly in thinking rather than serve exclusively as a source of information (Barsalou 1999; Chandrasekaran 2006). Mental imagery is an example of such thought processing. In this paper, we argue that general, intelligent systems require mechanisms to compose and manipulate amodal, symbolic and modality-specific representations. We defend our argument by presenting a synthesis of cognition and mental imagery constrained by a cognitive architecture, Soar (Laird 2008). Empirical results strengthen our claim by demonstrating how specialized, architectural components processing these representations can provide an agent with additional reasoning capability in spatial and visual tasks. Related Work One of the key findings in mental imagery experiments is that humans imagine objects using multiple representations and mechanisms associated with perception (Kosslyn, et al., 2006). For spatial and visual imagery, we assume there are at least three distinct representations: (1) amodal symbolic, (2) quantitative spatial, and (3) visual depictive. General reasoning with each of these representations is a key Scott D. Lathrop John E. Laird United States Military Academy University of Michigan D/EECS 2260 Hayward West Point, NY 10996 Ann Arbor, MI 48109-2121 scott.lathrop@usma.edu laird@umich.edu 97 distinction between this work and others. The history of using these representations in AI systems begins perhaps with Gelernter’s (1959) geometry theorem prover and Funt’s (1976) WHISPER system that reasoned with quantitative and depictive representations respectively. Some researchers, to include Glasgow and Papadias (1992) and Barkowsky (2002), incorporated mental imagery constraints in the design of their specific applications. The CaMeRa model of Tabachneck-Schijf’s et al. (1997) is perhaps the closest system related to this work. CaMeRa uses symbolic, quantitative, and depictive representations and includes visual short-term and long-term memories. Whereas their shape representation is limited to algebraic shapes (i.e. points and lines), we leave the type of object open-ended. CaMeRa’s spatial memory is limited to an object’s location while ignoring orientation, size, and hierarchical composition (e.g. a car is composed of a frame, four wheels, etc.). Our system uses these spatial properties, providing significantly more reasoning capability. Cognitive architectures have traditionally ignored modality specific representations. ACT-R’s (Anderson, 2007) perceptual and motor systems focus on timing predictions and resource constraints rather than their reuse for reasoning. Some researchers have extended the perception and motor capabilities of cognitive architectures (e.g. see Best et al., 2002; Wray et al., 2005). Each contribution effectively pushes the system closer to the environment but requires ad-hoc, bolted-on components tailored for specific domains. These approaches assume that cognition abandons perceptual mechanisms after input rather than using these mechanisms for problem solving. Kurup and Chandrasekaran (2007) argue for general, multi-modal architectures and augment Soar with diagrammatic reasoning. They are non-committal as to whether diagrams are quantitative or depictive. Their current implementation uses strictly quantitative structures. Key differences include their proposal for a single, working memory containing both symbolic and diagrammatic representations. We propose separate symbolic and representation-specific short-term memories where perceptual representations are not directly accessible to the symbolic system. Whereas their diagrammatic system constrains the specific types of objects to a point, curve, or
egion, we leave the type of object open-ended to any shape the agent experiences in the world or imagines by composing known objects. Wintermute and Laird (2008) extend Soar with a spatial reasoning system that focuses on translating qualitative predicates into quantitative representations and simulating continuous motion—extending the framework described here as it relates to spatial imagery. Gunzelmann and Lyon (2007) propose extending ACT-R with specialized, spatial processing that includes quantitative information. They do not plan to incorporate depictive representations without compelling evidence for their use. We hope to provide some evidence and argue that all three representations are necessary to achieve general functionality. Experimental Environment In previous work, Lathrop and Laird (2007) demonstrated how extending Soar with quantitative spatial and visual depictive representations provided an agent with capabilities for recognizing implicit spatial and visual properties. However, the results were limited to solving internally represented problems. This paper extends those results to a dynamic environment where the agent must interpret and act on information from multiple internal and external sources. The U.S. Army’s work in developing robotic scouts for reconnaissance missions (Jaczkowski, 2002) motivates the evaluation environment. In support of this effort, we built a simulation modeling a section of two robotic scout vehicles that must cooperate to maintain visual observation with an approaching enemy (Figure1a). One scout, the section lead, is a Soar agent, modeled with and without mental imagery for evaluation purposes. The other, teammate, scout is scripted. The section’s primary goal is to keep their commander informed of the enemy’s movement by periodically sending observation reports (through the lead) of the enemy’s location and orientation. The agent cannot observe its teammate because of terrain occlusions. However, the teammate periodically sends messages regarding its position. The teammate continuously scans the area to its front (Figure 1b) and sends reports to the agent when it observes the enemy. The teammate can reorient its view in response to orders from the agent. The agent can look at the environment (Figure 1c) or its map (Figure 1d). To motivate the reasoning capabilities when using multiple representations, consider how the agent makes decisions in this domain. Typically, a scout follows these steps after initial visual contact: (1) Deploy and report, (2) analyze the situation, and (3) choose and execute a course of action (U.S. Army 2002). Analysis involves reasoning about friendly and enemy locations and orientations, terrain, and obstacles. If the scout leader does not know the locations of all expected enemy, then he might hypothesize where other enemy entities are (Figure 1d). Note that the hypothesized enemy in Figure 1d is not the same as the actual situation in Figure 1a but rather an estimate based on the agent’s knowledge of how the enemy typically fights. 98 Analysis involves visualizing the situation and mentally simulating alternatives. Using spatial imagery, an agent can imagine each observed entity’s map icon on its external map. If the agent is confident in the information, it can “write” it on the external map, in effect making it persist. As information changes the agent updates the map, keeping its perceived image of the situation up to date. Using the external map as perceptual background, the agent can then imagine key terrain (enemy goals), possible enemy paths, its viewpoint, and its teammate’s viewpoint. Using visual imagery to take advantage of explicit space representation in a depiction, the agent can imagine what portion of those viewpoints cover the possible enemy paths and then imagine alternative courses of action by simulating different viewpoints. Based on the analysis the agent decides if it should reorient itself, its teammate, or both. In summary, decision-making proceeds by combining perceptual representations with task specific knowledge to construct an imagined scene. Analysis emerges through the manipulation of symbolic, quantitative, and depictive representations. Retrieval of the resulting representations provides new information to the agent that it uses to reason and produce action in the environment. (a) Actual situation (b) Teammate’s view (c) Agent’s view (d) Agent’s perceived map/imagined situation Figure 1: Experimental Environment Architecture Soar and its Spatial-Visual Imagery (Soar+SVI) module are the two major components in our system (Figure 2). Soar encompasses the symbolic representation. SVI includes the quantitative and depictive representations. It encapsulates high-level visual perception and mental imagery processing. Soar’s symbolic memories include a declarative, shortterm memory (STM) and a procedural, long-term memory (LTM). The symbolic STM is a graph structure (Figure 2)
Page 2 and 3:
Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5:
Artificial General Intelligence Vol
Page 6:
Preface Artificial General Intellig
Page 9 and 10:
Organizing Committee Tsvi Achler U.
Page 11 and 12:
A formal framework for the symbol g
Page 13 and 14:
Importing Space-time Concepts Into
Page 15 and 16:
one structure, everything else adap
Page 17 and 18:
generalize is essential to understa
Page 19 and 20:
First Application The above framewo
Page 21 and 22:
problem solving, use of knowledge,
Page 23 and 24:
Of particular note is the separatio
Page 25 and 26:
Conclusions and Future Work While p
Page 27 and 28:
Conceptual Spaces The conceptual ar
Page 29 and 30:
perceptions and actions. The lingui
Page 31 and 32:
means of the knowledge stored in th
Page 33 and 34:
grams given some (syntactically res
Page 35 and 36:
For example, let reverse([]) = [] r
Page 37 and 38:
Igor2 produced solutions with auxil
Page 39 and 40:
evolve neural net modules as quickl
Page 41 and 42:
ain”, consisting of some dozen or
Page 43 and 44:
Population Chr NListPtr m Chrm is a
Page 45 and 46:
and shaping, we feel that exploring
Page 47 and 48:
Table 2 reviews the key capabilitie
Page 49 and 50:
One way to achieve this goal would
Page 51 and 52:
question. Our approach of staying a
Page 53 and 54:
H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 55 and 56:
Discussion and future work Testing
Page 57 and 58:
Types of Reasoning Corresponding Fo
Page 59 and 60: Integrating Different Types of Reas
Page 61 and 62: good overview of analogy models can
Page 63 and 64: ��
Page 65 and 66: � ��
Page 67 and 68: ��
Page 69 and 70: A Unified Framework for IP Conditio
Page 71 and 72: negative evidence when added to a r
Page 73 and 74: ing strategy. Due to GOLEM’s rand
Page 75 and 76: any state index, n∈IN the current
Page 77 and 78: is the best observation summary, wh
Page 79 and 80: to a code for the rewards only, whi
Page 81 and 82: have exponentially many states (2O(
Page 83 and 84: where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86: • The cost function can be improv
Page 87 and 88: decides to introduce inflation or d
Page 89 and 90: € € The diffusion matrix is the
Page 91 and 92: tuning may be done adaptively by te
Page 93 and 94: Of course, Harnad’s argument is n
Page 95 and 96: By exploring variations of Harnad
Page 97 and 98: Discussion The representational sys
Page 99 and 100: the cognitive map is less of a map
Page 101 and 102: models of cognition except in cases
Page 103 and 104: 7(b), we can see that the straight
Page 105 and 106: eaction times, error rates, or exac
Page 107 and 108: comparing behavior with and without
Page 109: Doorenbos, R. B. 1994. Combining le
Page 113 and 114: extract features in support of reco
Page 115 and 116: with a minimum score of -1.0. The
Page 117 and 118: the NASA Human Error Modeling compa
Page 119 and 120: whether their models are capable of
Page 121 and 122: Incorporating Planning and Reasonin
Page 123 and 124: see an unclaimed ten-dollar bill al
Page 125 and 126: swering user queries) and the state
Page 127 and 128: Program Representation for General
Page 129 and 130: distribution P corresponding to the
Page 131 and 132: with all Ei replaced by the unbound
Page 133 and 134: Consciousness in Human and Machine:
Page 135 and 136: at the sensory receptors, is pre-pr
Page 137 and 138: The second choice is to take a deta
Page 139 and 140: Hebbian Constraint on the Resolutio
Page 141 and 142: The Case of Inputs that Change by T
Page 143 and 144: This route is relevant if the noise
Page 145 and 146: 1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148: Our implemented Turing judge used a
Page 149 and 150: 2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152: UnsupervisedSegmentationofAudioSpee
Page 153 and 154: FinallyweusedthediscreteFourierTran
Page 155 and 156: Secondly,thereexistsmorethanonelogi
Page 157 and 158: Parsing PCFG within a General Proba
Page 159 and 160: known in advance, although many imp
Page 161 and 162:
Figure 2: Shows the underlying weig
Page 163 and 164:
Self-Programming: Operationalizing
Page 165 and 166:
expressivity. More over, self-progr
Page 167 and 168:
existence of a distance function ov
Page 169 and 170:
Bootstrap Dialog: A Conversational
Page 171 and 172:
elation is typically a verb. In the
Page 173 and 174:
node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176:
Analytical Inductive Programming as
Page 177 and 178:
Problem domain: puttable(x) PRE: cl
Page 179 and 180:
eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182:
Human and Machine Understanding Of
Page 183 and 184:
case orderings t correspond to the
Page 185 and 186:
in (1) received a different denotat
Page 187 and 188:
Abstract This paper analyzes the di
Page 189 and 190:
Having grounded meaning: In an inte
Page 191 and 192:
to experience” can be argued to b
Page 193 and 194:
Abstract Case-by-case Problem Solvi
Page 195 and 196:
First, since NARS is designed in th
Page 197 and 198:
typical response is to find such an
Page 199 and 200:
What Is Artificial General Intellig
Page 201 and 202:
Finally, the facts that both seem t
Page 203 and 204:
differ. NARS uses Narsese, the fair
Page 205 and 206:
Integrating Action and Reasoning th
Page 207 and 208:
states, with the condition that the
Page 209 and 210:
action model. The system is able to
Page 211 and 212:
Neuroscience and AI Share the Same
Page 213 and 214:
Relevance Based Planning: Why Its a
Page 215 and 216:
General Intelligence and Hypercompu
Page 217 and 218:
To appear, AGI-09 1 Stimulus proces
Page 219 and 220:
Distribution of Environments in For
Page 221 and 222:
The Importance of Being Neural-Symb
Page 223 and 224:
Improving the Believability of Non-
Page 225 and 226:
Understanding the Brain’s Emergen
Page 227 and 228:
Abstract The challenge of creating
Page 229 and 230:
Importing Space-time Concepts Into
Page 231 and 232:
HELEN: Using Brain Regions and Mech
Page 233 and 234:
Holistic Intelligence: Transversal
Page 235 and 236:
Achieving Artificial General Intell
Page 237:
Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?