A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

Symbolic Approaches Subsymbolic Approaches Methods (Mostly) logical and/or (Mostly) analytic algebraic Strengths Productivity, Recursion Robustness, Learning, Principle, Compositionality Parsimony, Adaptivity Weaknesses Consistency Constraints, Opaqueness Lower Cognitive Abilities Higher Cognitive Abilities Applications Reasoning, Problem Learning, Motor Control, Solving, Planning etc. Vision etc. CogSci Relation Not Biologically Inspired Biologically Inspired Other Features Crisp Fuzzy Table 4: Differences between symbolic and subsymbolic theories The three levels proposed above outline the different mechanisms that occur. Analogical learning in this sense is therefore a process in which different learning mechanisms interact and are iteratively repeated to refine and correct knowledge on the various abstraction levels. 5 Logic as Lingua Franca for AGI Systems In this section, we sketch ideas of how logic as the underlying representation formalism can be used for integrating subsymbolic devices to a logical reasoning system. Building Neural-Symbolic Learning Systems There is an obvious gap between symbolic and subsymbolic theories. As a matter of fact there is not only a methodological difference between these approaches, but furthermore strengths and weaknesses of the two paradigms are complementary distributed. Table 4 mentions some important distinctions between these two types of modeling options. In particular, for an AGI system intended to cover a large part of the breadth of cognitive abilities, a natural idea would be to bring these two research traditions together. There has been the research endeavor “neural-symbolic integration” attempting to resolve this tension between symbolic and subsymbolic approaches. 6 The idea is to transform a highly structured input (e.g. a logical theory) into a flat input appropriate for connectionist learning. It has been shown that learning theories of propositional logic with neural networks can be achieved by using different frameworks like the “core method” or KBANNs (Hölldobler and Kalinke, 1994; Towell and Shavlik, 1994). In recent years, extensions to predicate logic were proposed (Bader, Hitzler, and Hölldobler, 2008). We sketch some ideas of learning predicate logic with neural networks based on topos theory (Gust, Kühnberger, and Geibel, 2007). First, we introduce some concepts. A topos T is a category where all finite diagrams have limits (and colimits), any two objects have an exponential object, 5 This distinguishes HDTP from other learning approaches based on analogy. For example, the SME tradition (Falkenhainer, Forbus, and Gentner, 1989) focuses on alignment and transfer, whereas abstraction and interactions of different mechanisms do not play a crucial role. 6 A good overview of neural-symbolic integration can be found in (Hammer and Hitzler, 2007). 47 and there exists a subobject classifier (Goldblatt, 1984). As a consequence, T has an initial and terminal object, as well as finite products and coproducts. The prototypical example for a topos is the category SET . The objects of SET are sets, connected by set theoretic functions (called arrows). A product a × b can be identified with the well-known Cartesian product of two sets a and b, and an exponent a b with the set of functions f : b → a. The terminal object ! is the one-element set {0} with the property that for all sets a there is exactly one arrow from a into {0}. The truth value object Ω = {0, 1} and the subobject classifier true: ! → Ω mapping 0 to 1 generalizes characteristic functions and therefore interpretations of predicates. We summarize how a topos can be used for neural learning of a logical theory T given in a first–order language L. • T is translated into a variable-free representation in a topos T . The result is a representation of logic expressions by commuting diagrams, where objects are sets and arrows are set theoretic functions (in case we work in SET ). Logical terms can be interpreted as mappings from the terminal object into the universe U and logical 2-ary connectives as mappings Ω × Ω → Ω. Quantified formulas correspond to an operation mapping (complex) predicates to (complex) predicates. • An algorithm is generating equations of the form f◦g = h and inequations of the form f �= g corresponding to equations and inequations of arrows in the T , i.e. set theoretic functions in SET . • As representations for objects and arrows it is possible to choose vectors of the vector space R n . The resulting equations and inequations can be used in order to train a feedforward neural network by backpropagation. • The network learns a representation of objects and arrows of the underlying topos (which themselves represent symbols and expressions of L), such that the truth conditions of the underlying axioms of T are satisfied. In other words, the network learns a model of T . Although the approach still has many problems to solve, the very possibility to code axioms of a logical theory T , such that a neural network can learn a model of T , can be considered as the missing link between symbolic and subsymbolic representations. Applied to AGI systems, this means that logic can play the role of a lingua franca for general intelligence, without neglecting one of the two separated worlds of symbolic and subsymbolic computations. Rather it is possible to integrate both worlds into one architecture. A proposal for such an architecture (I-Cog) integrating both devices – the analogy engine and the neural-symbolic integration device – can be found in (Kühnberger et al., 2008). Related Work Analogies have been playing an important role in cognitive science and cognitively inspired AI for a long time. Classical frameworks are, for example, the symbolic approach SME (Falkenhainer, Forbus, and Gentner, 1989), the connectionist system LISA (Hummel and Holyoak, 1997), or the hybrid approach AMBR (Kokinov and Petrov, 2001). A
good overview of analogy models can be found in (Gentner, Holyoak, and Kokinov, 2001). There are numerous classical papers on neural-symbolic integration, e.g. (Barnden, 1989) as one of the early ones. More recent work is concerned with extensions of propositional logic (D’Avila Garcez, Broda, and Gabbay, 2002), and the modeling of predicate logic using the “core method” (Bader, Hitzler, and Hölldobler, 2008). Conclusions This paper argues for the usage of logic as the basis for an AGI system without neglecting other nature-inspired approaches for modeling intelligent behavior. Although there is a zoo of logical formalisms that are sometimes hard to integrate with each other, we claim that directions towards such an integration of a variety of different reasoning types already exist. As an example we proposed the analogy engine HDTP in order to integrate such diverse types of reasoning like analogical, deductive, inductive, and vague reasoning. Furthermore, non-trivial learning mechanisms can be detected by adapting and fine-tuning the analogical relation. Finally, this paper claims that even the interaction between symbolic theories and subsymbolic theories can be achieved by the usage of techniques developed in neuralsymbolic integration. Obviously, many issues remain open. Besides the challenge of an empirical justification of the presented framework, other issues for future work need to be addressed. Currently it is impossible to solve challenging problems (e.g. benchmark problems) for theorem provers with neuralsymbolic integration. Similarly, many questions of analogy making remain open and are not solved yet. One example is the integration of large knowledge bases, in particular, the retrieval of relevant knowledge, another one is the scalability of analogy making to applications in the large. Nevertheless, taking these ideas together it turns out that a variety of different cognitive abilities can be addressed in a uniform framework using logic as a mediating tool. References Bader, S., Hitzler, P., and Hölldobler, S. 2008. Connectionist model generation: A first-order approach. Neurocomputing, 71:2420–2432. Barnden, J.A. 1989. Neural net implementation of complex symbol processing in a mental model approach to syllogistic reasoning. In Proceedings of the International Joint Conference on Artificial Intelligence, 568-573. Brachman, R. and Schmolze, J. 1985. An Overview of KL-ONE Knowledge Representation System, Cognitive Science 9(2):171– 216. D’Avila Garcez, A., Broda, K., and Gabbay, D. 2002. Neural- Symbolic Learning Systems: Foundations and Applications. Berlin Heidelberg, Springer. Falkenhainer, B., Forbus, K., and Gentner, D. 1989. The structuremapping engine: Algorithm and example, Artificial Intelligence, 41:1-63. Gentner, D., Holyoak, K., and Kokinov, B. 2001. The Analogical Mind. Perspectives from Cognitive Science, Cambridge, MA: MIT Press. 48 Goldblatt, R. 1984. Topoi, the categorial analysis of logic. North- Holland, Amsterdam. Gust, H., Kühnberger, K.-U., and Geibel, P. 2007. Learning Models of Predicate Logical Theories with Neural Networks Based on Topos Theory. In: P. Hitzler and B. Hammer (eds.): Perspectives of Neural-Symbolic Integration, SCI 77, Springer, pp. 233-264. Gust, H., Kühnberger, K.-U., and Schmid, U. 2006. Metaphors and Heuristic-Driven Theory Projection. Theoretical Computer Science, 354:98-117. Hammer, B. and Hitzler, P. (eds.) 2007. Perspectives of Neural- Symbolic Integration. Springer, Berlin. Hasker, R. 1995. The replay of program derivations. PhD thesis, Champaign, IL, USA. Hölldobler, S. and Kalinke, Y. 1994. Ein massiv paralleles Modell für die Logikprogrammierung. Proceedings of the Tenth Logic Programming Workshop, WLP 94:89-92. Hummel, J. and Holyoak, K. 1997. Distributed representations of structure: A theory of analogical access and mapping, Psychological Review 104(3):427–466. Kokinov, B. and Petrov, A. 2001. Integrating Memory and Reasoning in Analogy–Making: The AMBR Model, in D. Gentner, K. Holyoak, B. Kokinov (eds.): The Analogical Mind. Perspectives from Cognitive Science, Cambridge Mass. (2001). Krumnack, U., Schwering, A., Kühnberger, K.-U., and Gust, H. 2007. Restricted Higher-Order Anti-Unification for Analogy Making. 20th Australian Joint Conference on Artificial Intelligence, Springer, pp. 273-282. Krumnack, U., Gust, H., Kühnberger, K.-U., and Schwering, A. 2008. Re-representation in a Logic-Based Model for Analogy Making. 21st Australian Joint Conference on Artificial Intelligence. Kühnberger, K.-U., Geibel, P., Gust, H., Krumnack, U., Ovchinnikova, E., Schwering, A., and Wandmacher, T. 2008. Inconsistencies in an Integrated Cognitive Architecture. In: P. Wang, B. Goertzel, and S. Franklin (eds.): Artificial General Intelligence 2008. Proceedings of the First AGI Conference, IOS Press, pp. 212-223. McCarthy, J. 1963 Situations, actions, and causal laws. Stanford Artificial Intelligence Project Memo 2, Stanford University, 1963. Minsky, M. 1975. A Framework for Representing Knowledge. In P. Winston (ed.): The Psychology of Computer Vision, New York, pp. 211–277. Plotkin, G. 1970. A note on inductive generalization. Machine Intelligence 5:153-163. Schwering, A., Krumnack, U., Kühnberger, K.-U. and Gust, H. 2009. Syntactic Principles of HDTP. To appear in Cognitive Systems Research. Sowa, J. 1987. Semantic Networks. In S. Shapiro (ed.): Encyclopedia of Artifical Intelligence, John Wiley & Sons. Towell, G. and Shavlik, J. 1994. Knowledge-Based Artificial Neural Networks. Artificial Intelligence 70(1-2):119-165. Wang, P. 2006. Rigid Flexibility: The Logic of Intelligence, Springer, 2006.
Page 2 and 3:
Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5:
Artificial General Intelligence Vol
Page 6:
Preface Artificial General Intellig
Page 9 and 10: Organizing Committee Tsvi Achler U.
Page 11 and 12: A formal framework for the symbol g
Page 13 and 14: Importing Space-time Concepts Into
Page 15 and 16: one structure, everything else adap
Page 17 and 18: generalize is essential to understa
Page 19 and 20: First Application The above framewo
Page 21 and 22: problem solving, use of knowledge,
Page 23 and 24: Of particular note is the separatio
Page 25 and 26: Conclusions and Future Work While p
Page 27 and 28: Conceptual Spaces The conceptual ar
Page 29 and 30: perceptions and actions. The lingui
Page 31 and 32: means of the knowledge stored in th
Page 33 and 34: grams given some (syntactically res
Page 35 and 36: For example, let reverse([]) = [] r
Page 37 and 38: Igor2 produced solutions with auxil
Page 39 and 40: evolve neural net modules as quickl
Page 41 and 42: ain”, consisting of some dozen or
Page 43 and 44: Population Chr NListPtr m Chrm is a
Page 45 and 46: and shaping, we feel that exploring
Page 47 and 48: Table 2 reviews the key capabilitie
Page 49 and 50: One way to achieve this goal would
Page 51 and 52: question. Our approach of staying a
Page 53 and 54: H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 55 and 56: Discussion and future work Testing
Page 57 and 58: Types of Reasoning Corresponding Fo
Page 59: Integrating Different Types of Reas
Page 63 and 64: ��
Page 65 and 66: � ��
Page 67 and 68: ��
Page 69 and 70: A Unified Framework for IP Conditio
Page 71 and 72: negative evidence when added to a r
Page 73 and 74: ing strategy. Due to GOLEM’s rand
Page 75 and 76: any state index, n∈IN the current
Page 77 and 78: is the best observation summary, wh
Page 79 and 80: to a code for the rewards only, whi
Page 81 and 82: have exponentially many states (2O(
Page 83 and 84: where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86: • The cost function can be improv
Page 87 and 88: decides to introduce inflation or d
Page 89 and 90: € € The diffusion matrix is the
Page 91 and 92: tuning may be done adaptively by te
Page 93 and 94: Of course, Harnad’s argument is n
Page 95 and 96: By exploring variations of Harnad
Page 97 and 98: Discussion The representational sys
Page 99 and 100: the cognitive map is less of a map
Page 101 and 102: models of cognition except in cases
Page 103 and 104: 7(b), we can see that the straight
Page 105 and 106: eaction times, error rates, or exac
Page 107 and 108: comparing behavior with and without
Page 109 and 110: Doorenbos, R. B. 1994. Combining le
Page 111 and 112:
egion, we leave the type of object
Page 113 and 114:
extract features in support of reco
Page 115 and 116:
with a minimum score of -1.0. The
Page 117 and 118:
the NASA Human Error Modeling compa
Page 119 and 120:
whether their models are capable of
Page 121 and 122:
Incorporating Planning and Reasonin
Page 123 and 124:
see an unclaimed ten-dollar bill al
Page 125 and 126:
swering user queries) and the state
Page 127 and 128:
Program Representation for General
Page 129 and 130:
distribution P corresponding to the
Page 131 and 132:
with all Ei replaced by the unbound
Page 133 and 134:
Consciousness in Human and Machine:
Page 135 and 136:
at the sensory receptors, is pre-pr
Page 137 and 138:
The second choice is to take a deta
Page 139 and 140:
Hebbian Constraint on the Resolutio
Page 141 and 142:
The Case of Inputs that Change by T
Page 143 and 144:
This route is relevant if the noise
Page 145 and 146:
1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148:
Our implemented Turing judge used a
Page 149 and 150:
2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152:
UnsupervisedSegmentationofAudioSpee
Page 153 and 154:
FinallyweusedthediscreteFourierTran
Page 155 and 156:
Secondly,thereexistsmorethanonelogi
Page 157 and 158:
Parsing PCFG within a General Proba
Page 159 and 160:
known in advance, although many imp
Page 161 and 162:
Figure 2: Shows the underlying weig
Page 163 and 164:
Self-Programming: Operationalizing
Page 165 and 166:
expressivity. More over, self-progr
Page 167 and 168:
existence of a distance function ov
Page 169 and 170:
Bootstrap Dialog: A Conversational
Page 171 and 172:
elation is typically a verb. In the
Page 173 and 174:
node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176:
Analytical Inductive Programming as
Page 177 and 178:
Problem domain: puttable(x) PRE: cl
Page 179 and 180:
eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182:
Human and Machine Understanding Of
Page 183 and 184:
case orderings t correspond to the
Page 185 and 186:
in (1) received a different denotat
Page 187 and 188:
Abstract This paper analyzes the di
Page 189 and 190:
Having grounded meaning: In an inte
Page 191 and 192:
to experience” can be argued to b
Page 193 and 194:
Abstract Case-by-case Problem Solvi
Page 195 and 196:
First, since NARS is designed in th
Page 197 and 198:
typical response is to find such an
Page 199 and 200:
What Is Artificial General Intellig
Page 201 and 202:
Finally, the facts that both seem t
Page 203 and 204:
differ. NARS uses Narsese, the fair
Page 205 and 206:
Integrating Action and Reasoning th
Page 207 and 208:
states, with the condition that the
Page 209 and 210:
action model. The system is able to
Page 211 and 212:
Neuroscience and AI Share the Same
Page 213 and 214:
Relevance Based Planning: Why Its a
Page 215 and 216:
General Intelligence and Hypercompu
Page 217 and 218:
To appear, AGI-09 1 Stimulus proces
Page 219 and 220:
Distribution of Environments in For
Page 221 and 222:
The Importance of Being Neural-Symb
Page 223 and 224:
Improving the Believability of Non-
Page 225 and 226:
Understanding the Brain’s Emergen
Page 227 and 228:
Abstract The challenge of creating
Page 229 and 230:
Importing Space-time Concepts Into
Page 231 and 232:
HELEN: Using Brain Regions and Mech
Page 233 and 234:
Holistic Intelligence: Transversal
Page 235 and 236:
Achieving Artificial General Intell
Page 237:
Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?