A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

Anderson & Lebiere, 2003). A key common aspect is to enforce generality in approach, in order to prevent specialpurpose optimization to narrow tasks and force integration of capabilities. One can view that strategy as effectively overwhelming the degrees of freedom in the architecture with converging constraints in the data. However, precise computational specifications of those tests have to tread a tight rope between requiring unreasonable amounts of effort in modeling broad and complex tasks and falling back into narrow task specifications that will again favor engineered, optimized approaches. This model competition is our attempt at testing general cognitive capabilities in an open-ended task while offering low barriers to entry. We see model comparison as one solution to these problems. Model comparison is not just a practical solution for understanding how different systems work, but a theoretical prescription for identifying invariant structures among different approaches, seeing how they are, in fact, applied to specific problems and a way of seeing past the buzzwords to the mechanism that might finally illuminate what is needed to realize an artificial general intelligence. References Anderson, J. R. & Lebiere, C. L. 2003. The Newell test for a theory of cognition. Behavioral & Brain Sciences 26, 587-637. Asada, M., Kitano, H., Noda, I., and Veloso, M. 1999. RoboCup: Today and tomorrow – What we have have learned. Artificial Intelligence, 110:193–214. Brehmer, B., & Allard, R. 1991. Dynamic decisionmaking: The effects of task complexity and feedback delay. In J. Rasmussen, B. Brehmer & J. Leplat (Eds.), Distributed decision making: Cognitive models of cooperative work (pp. 319-334). Chichester: Wiley. Billings, D. 2000. The First International RoShamBo Programming Competition. ICGA Journal, Vol. 23, No. 1, pp. 42-50. Cohen, P. 2005. If Not Turing’s Test, Then What? AI Magazine 26(4): 61–67. Cronin, M., & Gonzalez, C. 2007. Understanding the building blocks of dynamic systems. System Dynamics Review. 23(1), 1-17. Cronin, M., Gonzalez, C., & Sterman, J. D. 2008. Why don't well-educated adults understand accumulation? A challenge to researchers, educators and citizens. In press. Organizational Behavior and Human Decision Processes. Diehl, E., & Sterman, J. D. 1995. Effects of feedback complexity on dynamic decision-making. Organizational Behavior and Human Decision Processes, 62(2), 198-215. Dutt, V. & Gonzalez, C. 2008. Human Perceptions of Climate Change. The 26 th International Conference of the System Dynamics Society. (pp. ). Athens, Greece: System Dynamics Society. 107 Dutt, V. & Gonzalez, C. 2007. Slope of Inflow Impacts Dynamic Decision Making. The 25 th International Conference of the System Dynamics Society. (pp. 79). Boston, MA: System Dynamics Society. Erev, I., Ert, E., Roth, A. E., Haruvy, E., Herzog, S., Hau, R., Hertwig, R., Stewart, T., West, R., & Lebiere, C. (submitted). A choice prediction competition, for choices from experience and from description. Journal of Behavioral Decision Making. Foyle, D. & Hooey, B. 2008. Human Performance Modeling in Aviation. Mahwah, NJ: Erlbaum. Gluck, K, & Pew, R. 2005. Modeling Human Behavior with Integrated Cognitive Architectures. Mahwah, NJ: Erlbaum. Gonzalez, C., & Dutt, V. 2007. Learning to control a dynamic task: A system dynamics cognitive model of the slope effect. In Proceedings of the 8th International Conference on Cognitive Modeling, Ann Arbor, MI. Hsu, F-H. 2002. Behind Deep Blue: Building the Computer that Defeated the World Chess Champion. Princeton University Press. Lebiere, C., & Bothell, D. 2004. Competitive Modeling Symposium: PokerBot World Series. In Proceedings of the Sixth International Conference on Cognitive Modeling, Pp. 32-32. Lebiere, C., Gonzalez, C., & Warwick, W. (submitted). Emergent Complexity in a Dynamic Control Task: Model Comparison. Newell, A. 1990. Unified Theories of Cognition. Harvard University Press. Roberts, S. and Pashler, H. 2000. “How Persuasive Is a Good Fit? A Comment on Theory testing.” Psychological Review 107(2): pp358-367. Schunn, C. D. & Wallach, D. 2001. Evaluating goodnessof-fit in comparisons of models to data. Online manuscript. http://lrdc.pitt.edu/schunn/gof/index.html Selman, B., Brooks, R., Dean, T., Horvitz, E., Mitchell, T., & Nilsson, N. 1996. Challenge problems for artificial intelligence. In Proceedings of the 13th Natl. Conf. on Artificial Intelligence (AAAI-96), Portland, OR, pp. 193- 224. Thrun, S. et al. 2006. Stanley, the robot that won the DARPA Grand Challenge. Journal of Field Robotics, 23(9), 661–692. Turing, A. 1950. Computing Machinery and Intelligence, Mind LIX (236): 433–460. Warwick, W., Allender, L., Strater, L., & Yen, J. 2008. AMBR Redux: Another Take on Model Comparison. Symposium given at the Seventeenth Conference on Behavior Representation and Simulation. Providence, RI.
Incorporating Planning and Reasoning into a Self-Motivated, Communicative Agent Abstract Most work on self-motivated agents has focused on acquiring utility-optimizing mappings from states to actions. But such mappings do not allow for explicit, reasoned anticipation and planned achievement of future states and rewards, based on symbolic knowledge about the environment and about the consequences of the agent’s own behavior. In essence, such agents can only behave reflexively, rather than reflectively. Conversely, planning and reasoning have been viewed within AI as geared towards satisfaction of explicitly specified user goals, without consideration of the long-range utility of the planner/reasoner’s choices. We take a step here towards endowing a self-motivated, utility-optimizing agent with reasoning and planning abilities, and show that such an agent benefits both from its knowledge about itself and its environment, and from exploiting opportunities as it goes. Our simulated simple agent can cope with unanticipated environmental events and can communicate with the user or with other agents. Introduction There is rather broad agreement in AI that general human-like intelligent behavior, apart from lower-level activities like perception and reflexes, is guided by planning and reasoning. However, planning and reasoning have traditionally been understood as aimed at the fulfillment of specified user goals, rather than as internally motivated by consideration of the long-range utility of the planner/reasoner’s choices. Conversely, research on self-motivated agents has focused almost exclusively on acquiring utility-optimizing mappings from states to actions, without reasoned anticipation and planned attainment of future states and rewards based on symbolic knowledge about the environment and consequences of the agent’s own behavior. These observations have motivated our work on explicit self-awareness (Sch05) and more thoughtful self-motivation in agents. Here we present a self-aware and self-motivated agent that thinks ahead, plans and reasons deliberately, and acts reflectively, both by drawing on knowledge about itself and its environment and by seizing opportunities Copyright c○ 2009, The Second Conference on Artificial General Intelligence (AGI-09.org). All rights reserved. Daphne Liu and Lenhart Schubert Department of Computer Science University of Rochester Rochester, NY USA 108 to optimize its utility. As an initial step towards a full conversation agent, our simple agent can communicate with the user or with other agents in addition to coping with unforeseen environmental events while acting opportunistically. In the following sections, we discuss the notions of explicit self-awareness (Sch05) and thoughtful selfmotivation, as well as how they are realized in our system. Then we outline our system and describe how our agent benefits from self-awareness, thoughtful selfmotivation, and opportunistic choices. We conclude with a summary and a discussion of future work. Explicit Self-Awareness Explicit self-awareness was characterized by Schubert (Sch05) as being both human-like and explicit. Specifically, it is human-like in that an explicitly self-aware agent must have a well-elaborated human-like model of the world, including a model of itself and its relationships to the world. The self-model encompasses its beliefs, desires, intentions, knowledge, abilities, autobiography, the current situation, etc.; in addition, the agent must be capable of goal- and utility-directed reasoning and planning. In addition to prescribing the aforementioned humanlike capabilities, explicit self-awareness is explicit in three respects. First, an agent’s representation of selfknowledge must be amenable to self-observation and use by the agent (and for engineering reasons, browsable and comprehensible to the designer). Second, explicit self-awareness must be conveyable by the agent, through language or other modalities. Third, the agent’s self-knowledge must be amenable to inferences in conjunction with world knowledge. Schubert (Sch05) enumerated reasons motivating the need for explicit self-awareness. First, given its bootstrapping potential with respect to meta-control, error recovery, autoepistemic reasoning, and learning of skills or facts, explicit self-awareness would help expand the frontiers of AI. Moreover, an explicitly self-aware agent can interact with human users in a transparent, natural, and engaging manner by having a shared context. Lastly, operational, explicitly self-aware agents can serve as exemplars of entities whose internal ba-
Page 2 and 3:
Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5:
Artificial General Intelligence Vol
Page 6:
Preface Artificial General Intellig
Page 9 and 10:
Organizing Committee Tsvi Achler U.
Page 11 and 12:
A formal framework for the symbol g
Page 13 and 14:
Importing Space-time Concepts Into
Page 15 and 16:
one structure, everything else adap
Page 17 and 18:
generalize is essential to understa
Page 19 and 20:
First Application The above framewo
Page 21 and 22:
problem solving, use of knowledge,
Page 23 and 24:
Of particular note is the separatio
Page 25 and 26:
Conclusions and Future Work While p
Page 27 and 28:
Conceptual Spaces The conceptual ar
Page 29 and 30:
perceptions and actions. The lingui
Page 31 and 32:
means of the knowledge stored in th
Page 33 and 34:
grams given some (syntactically res
Page 35 and 36:
For example, let reverse([]) = [] r
Page 37 and 38:
Igor2 produced solutions with auxil
Page 39 and 40:
evolve neural net modules as quickl
Page 41 and 42:
ain”, consisting of some dozen or
Page 43 and 44:
Population Chr NListPtr m Chrm is a
Page 45 and 46:
and shaping, we feel that exploring
Page 47 and 48:
Table 2 reviews the key capabilitie
Page 49 and 50:
One way to achieve this goal would
Page 51 and 52:
question. Our approach of staying a
Page 53 and 54:
H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 55 and 56:
Discussion and future work Testing
Page 57 and 58:
Types of Reasoning Corresponding Fo
Page 59 and 60:
Integrating Different Types of Reas
Page 61 and 62:
good overview of analogy models can
Page 63 and 64:
��
Page 65 and 66:
� ��
Page 67 and 68:
��
Page 69 and 70: A Unified Framework for IP Conditio
Page 71 and 72: negative evidence when added to a r
Page 73 and 74: ing strategy. Due to GOLEM’s rand
Page 75 and 76: any state index, n∈IN the current
Page 77 and 78: is the best observation summary, wh
Page 79 and 80: to a code for the rewards only, whi
Page 81 and 82: have exponentially many states (2O(
Page 83 and 84: where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86: • The cost function can be improv
Page 87 and 88: decides to introduce inflation or d
Page 89 and 90: € € The diffusion matrix is the
Page 91 and 92: tuning may be done adaptively by te
Page 93 and 94: Of course, Harnad’s argument is n
Page 95 and 96: By exploring variations of Harnad
Page 97 and 98: Discussion The representational sys
Page 99 and 100: the cognitive map is less of a map
Page 101 and 102: models of cognition except in cases
Page 103 and 104: 7(b), we can see that the straight
Page 105 and 106: eaction times, error rates, or exac
Page 107 and 108: comparing behavior with and without
Page 109 and 110: Doorenbos, R. B. 1994. Combining le
Page 111 and 112: egion, we leave the type of object
Page 113 and 114: extract features in support of reco
Page 115 and 116: with a minimum score of -1.0. The
Page 117 and 118: the NASA Human Error Modeling compa
Page 119: whether their models are capable of
Page 123 and 124: see an unclaimed ten-dollar bill al
Page 125 and 126: swering user queries) and the state
Page 127 and 128: Program Representation for General
Page 129 and 130: distribution P corresponding to the
Page 131 and 132: with all Ei replaced by the unbound
Page 133 and 134: Consciousness in Human and Machine:
Page 135 and 136: at the sensory receptors, is pre-pr
Page 137 and 138: The second choice is to take a deta
Page 139 and 140: Hebbian Constraint on the Resolutio
Page 141 and 142: The Case of Inputs that Change by T
Page 143 and 144: This route is relevant if the noise
Page 145 and 146: 1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148: Our implemented Turing judge used a
Page 149 and 150: 2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152: UnsupervisedSegmentationofAudioSpee
Page 153 and 154: FinallyweusedthediscreteFourierTran
Page 155 and 156: Secondly,thereexistsmorethanonelogi
Page 157 and 158: Parsing PCFG within a General Proba
Page 159 and 160: known in advance, although many imp
Page 161 and 162: Figure 2: Shows the underlying weig
Page 163 and 164: Self-Programming: Operationalizing
Page 165 and 166: expressivity. More over, self-progr
Page 167 and 168: existence of a distance function ov
Page 169 and 170: Bootstrap Dialog: A Conversational
Page 171 and 172:
elation is typically a verb. In the
Page 173 and 174:
node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176:
Analytical Inductive Programming as
Page 177 and 178:
Problem domain: puttable(x) PRE: cl
Page 179 and 180:
eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182:
Human and Machine Understanding Of
Page 183 and 184:
case orderings t correspond to the
Page 185 and 186:
in (1) received a different denotat
Page 187 and 188:
Abstract This paper analyzes the di
Page 189 and 190:
Having grounded meaning: In an inte
Page 191 and 192:
to experience” can be argued to b
Page 193 and 194:
Abstract Case-by-case Problem Solvi
Page 195 and 196:
First, since NARS is designed in th
Page 197 and 198:
typical response is to find such an
Page 199 and 200:
What Is Artificial General Intellig
Page 201 and 202:
Finally, the facts that both seem t
Page 203 and 204:
differ. NARS uses Narsese, the fair
Page 205 and 206:
Integrating Action and Reasoning th
Page 207 and 208:
states, with the condition that the
Page 209 and 210:
action model. The system is able to
Page 211 and 212:
Neuroscience and AI Share the Same
Page 213 and 214:
Relevance Based Planning: Why Its a
Page 215 and 216:
General Intelligence and Hypercompu
Page 217 and 218:
To appear, AGI-09 1 Stimulus proces
Page 219 and 220:
Distribution of Environments in For
Page 221 and 222:
The Importance of Being Neural-Symb
Page 223 and 224:
Improving the Believability of Non-
Page 225 and 226:
Understanding the Brain’s Emergen
Page 227 and 228:
Abstract The challenge of creating
Page 229 and 230:
Importing Space-time Concepts Into
Page 231 and 232:
HELEN: Using Brain Regions and Mech
Page 233 and 234:
Holistic Intelligence: Transversal
Page 235 and 236:
Achieving Artificial General Intell
Page 237:
Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

Create successful ePaper yourself

Delete template?

Save as template?