A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

increases to infinity. It would make sense for different environments to have different time weight distributions. Two points for consideration despite their being discouraged by reviewers of this paper: 1. If PUTM programs were answers (as in Solomonoff Induction, where an agent seeks programs that match observed environment behavior) then weighting short programs more heavily would make sense, since shorter answers are better (according to Occam's razor). But here they are being used as questions and longer programs pose more difficult questions so arguably should be weighted more heavily. 2. The physical universe contains no more than 10 90 bits (Lloyd 2002) so is a finite state machine (FSM). Hence an intelligence measure based on FSMs is more realistic than one based on Turing machines. No Free Lunch and a Finite Model The No-Free-Lunch Theorem (NFLT) tells us that all optimization algorithms have equal performance when averaged over all finite environments (Wolpert and Macready 1997). It is interesting to investigate what relation this result has to intelligence measures that average agent performance over environments. To define an intelligence measure based on finite environments take the sets A, O and R of actions, observations and rewards as finite and fixed. An environment is defined by a FSM: f:S×A→S×O×R where S is a finite set of states. The value of an agent in this environment is the expected value of a weighted sum over a finite sequence of future rewards, with weights summing to 1. The measured intelligence of an agent is a weighted sum of its values in environments whose state set sizes fall in a finite range, weights summing to 1. This finite model lacks an important hypothesis of the NFLT: that the optimization algorithm never makes the same action more than once. The same result can be achieved by a no repeating state condition (NRSC) on environment FSMs: that they can never repeat the same state. Although this may seem artificial, it applies in the physical universe because of the second law of thermodynamics. Assuming the NRSC and that all FSMs with the same number of states share the same environment weight and the same sequence of time weights, then all agents have the same measured intelligence, the average reward (∑r∈R r) / |R| (Hibbard 2008). Conclusion According to current physics the universe is a FSM satisfying the NRSC. If we measure agent intelligence using a distribution of FSMs satisfying the NRSC in which all FSMs with the same number of states have the same weight, then all agents have the same measured intelligence. In this environment distribution past behavior of environments provides no information about their future behavior. For a useful measure of 207 intelligence, environments must be weighted to enable agents to predict the future from the past. This is the idea behind Kolmogorov complexity: to more heavily weight environments that can be generated by short programs since agents can more easily learn their behaviors. However, Proposition 1 shows that a PUTM must be chosen carefully in an intelligence measure based on Kolmogorov complexity. This suggests a distribution of environments based on program length but less abstract than Kolmogorov complexity. So define a PUTM based on an ordinary programming language. States and behaviors never repeat in the physical world, but human agents learn to predict future behavior in the world by recognizing current behavior as similar to previously observed behaviors and making predictions based on those previous behaviors. Similarity can be recognized in sequences of values from unstructured sets such as {0, 1}, but there are more ways to recognize similarity in sequences of values from sets with metric and algebraic structures such as numerical sets. Our physical world is described largely by numerical variables, and the best human efforts to predict behaviors in the physical world use numerical programming languages. So the sets A and O of actions and observations should be defined using numerical values, just as rewards are taken from a numerical set R. Including primitives for numerical operations in environment programs has the effect of skewing the distribution of environments toward similarity with the physical world. An ordinary numerical programming language is a good candidate basis for a formal measure of intelligence. But the real point of this paper is that distributions over environments pose complex issues for formal intelligence measures. Ultimately our definition of intelligence depends on the intuition we develop from using our minds in the physical world, and the key to a useful formal measure is the way its weighting distribution over environments abstracts from our world. References Hibbard, 2008. http://www.ssec.wisc.edu/~billh/g/de.pdf Legg, S. and M. Hutter. Proc. A Formal Measure of Machine Intelligence. 15th Annual Machine Learning Conference of Belgium and The Netherlands (Benelearn 2006), pages 73-80. http://www.idsia.ch/idsiareport/IDSIA-10-06.pdf Li, M. and P. Vitányi, An Introduction to Kolmogorov Complexity and Its Applications, 2nd ed.. Springer, New York, 1997. 637 pages. Lloyd, S. Computational Capacity of the Universe. Phys.Rev.Lett. 88 (2002) 237901. http://arxiv.org/abs/quant-ph/0110141 Wolpert, D. and W. Macready, No Free Lunch Theorems for Optimization. IEEE Transactions on Evolutionary Computation 1, 67. 1997. http://ic.arc.nasa.gov/people/dhw/papers/78.pdf
The Importance of Being Neural-Symbolic – A Wilde Position Pascal Hitzler AIFB, Karlsruhe Institute of Technology, Germany Abstract We argue that Neural-Symbolic Integration is a topic of central importance for the advancement of Artificial General Intelligence. What We Want Artificial General Intelligence – the quest for artificially created entities with human-like abilities – has been pursued by humanity since the invention of machines. It has also been a driving force in establishing artificial intelligence (AI) as a discipline. 20th century AI, however, has developed into a much narrower direction, focussing more and more on special-purpose and single-method driven solutions for problems which were once (or still are) considered to be challenging, like game playing, speech recognition, natural language understanding, computer vision, cognitive robotics, and many others. 20th century AI can, in our opinion, be perceived as expert system AI, producing and pursuing solutions for specific tasks. We don’t say that this is a bad development – quite in contrast, we think that this was (and still is) a very worthwhile adventure with ample (and in some cases well-proven) scope for considerable impact on our society. The pursuit of Artificial General Intelligence (AGI), however, has been declining in the 20th century, presumably because the original vision of establishing systems with the envisioned capabilities turned out to be much harder to realise than it had seemed in the beginning. But in recent years a rejuvenation of the original ideas has become apparent, driven on the one hand by the insight that certain complex tasks are outside the scope of specialised systems, and on the other hand by rapid developments in the neurosciences based on the invention of substantially refined means of recording and analysing neural activation patterns in the brain. These are accompanied by interdisciplinary efforts within the cognitive science community, including psychologists and linguists with similar visions. It is apparent that the realisation of AGI requires the cross-disciplinary integration of ideas, methods, and theories. Indeed we believe that disciplines such as (narrow) artificial intelligence, neuroscience, psychol- 208 Kai-Uwe Kühnberger IKW, University of Osnabrück, Germany ogy, and computational linguistics will have to converge substantially before we can hope to realise human-like artificially intelligent systems. One of the central questions in this pursuit is thus a meta-question: What are concrete lines of research which can be pursued in the immediate future in order to advance in the right direction? The general vision does not give any answers to this, and while it is obvious that we require some grand all-encompassing interdisciplinary theories for AGI, we cannot hope to achieve this in one giant leap. For practical purposes – out of pure necessity since we cannot shred our scientific inheritance – we require the identification of next steps, of particular topics which are narrow enough so that they can be pursued, but general enough so that they can advance us into the right direction. What We Propose Our proposal for such a research direction starts from two obvious observations. • The physical implementation of our mind is based on the neural system, i.e. on a network of neurons as identified and investigated in the neurosciences. If we hope to achieve Artificial General Intelligence, we cannot expect to ignore this neural or subsymbolic aspect of biological intelligent systems. • Formal modelling of complex tasks and human thinking is based on symbol manipulation, complex symbolic structures (like graphs, trees, shapes, and grammars) and mathematical logic. At present, there exists no viable alternative to symbolic modelling in order to encode complex tasks. These two perspectives however – the neural and the symbolic – are substantially orthogonal to each other in terms of the state of the art in the corresponding disciplines. Neural systems are hard if not impossible to understand symbolically. It is quite unclear at present how symbolic processing at large emerges from neural systems. Symbolic knowledge representation and manipulation at the level required for AGI is way outside the scope of current artificial neural approaches. At the same time humans – using their neural-based brains – are able to deal successfully with symbolic
Page 2 and 3:
Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5:
Artificial General Intelligence Vol
Page 6:
Preface Artificial General Intellig
Page 9 and 10:
Organizing Committee Tsvi Achler U.
Page 11 and 12:
A formal framework for the symbol g
Page 13 and 14:
Importing Space-time Concepts Into
Page 15 and 16:
one structure, everything else adap
Page 17 and 18:
generalize is essential to understa
Page 19 and 20:
First Application The above framewo
Page 21 and 22:
problem solving, use of knowledge,
Page 23 and 24:
Of particular note is the separatio
Page 25 and 26:
Conclusions and Future Work While p
Page 27 and 28:
Conceptual Spaces The conceptual ar
Page 29 and 30:
perceptions and actions. The lingui
Page 31 and 32:
means of the knowledge stored in th
Page 33 and 34:
grams given some (syntactically res
Page 35 and 36:
For example, let reverse([]) = [] r
Page 37 and 38:
Igor2 produced solutions with auxil
Page 39 and 40:
evolve neural net modules as quickl
Page 41 and 42:
ain”, consisting of some dozen or
Page 43 and 44:
Population Chr NListPtr m Chrm is a
Page 45 and 46:
and shaping, we feel that exploring
Page 47 and 48:
Table 2 reviews the key capabilitie
Page 49 and 50:
One way to achieve this goal would
Page 51 and 52:
question. Our approach of staying a
Page 53 and 54:
H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 55 and 56:
Discussion and future work Testing
Page 57 and 58:
Types of Reasoning Corresponding Fo
Page 59 and 60:
Integrating Different Types of Reas
Page 61 and 62:
good overview of analogy models can
Page 63 and 64:
��
Page 65 and 66:
� ��
Page 67 and 68:
��
Page 69 and 70:
A Unified Framework for IP Conditio
Page 71 and 72:
negative evidence when added to a r
Page 73 and 74:
ing strategy. Due to GOLEM’s rand
Page 75 and 76:
any state index, n∈IN the current
Page 77 and 78:
is the best observation summary, wh
Page 79 and 80:
to a code for the rewards only, whi
Page 81 and 82:
have exponentially many states (2O(
Page 83 and 84:
where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86:
• The cost function can be improv
Page 87 and 88:
decides to introduce inflation or d
Page 89 and 90:
€ € The diffusion matrix is the
Page 91 and 92:
tuning may be done adaptively by te
Page 93 and 94:
Of course, Harnad’s argument is n
Page 95 and 96:
By exploring variations of Harnad
Page 97 and 98:
Discussion The representational sys
Page 99 and 100:
the cognitive map is less of a map
Page 101 and 102:
models of cognition except in cases
Page 103 and 104:
7(b), we can see that the straight
Page 105 and 106:
eaction times, error rates, or exac
Page 107 and 108:
comparing behavior with and without
Page 109 and 110:
Doorenbos, R. B. 1994. Combining le
Page 111 and 112:
egion, we leave the type of object
Page 113 and 114:
extract features in support of reco
Page 115 and 116:
with a minimum score of -1.0. The
Page 117 and 118:
the NASA Human Error Modeling compa
Page 119 and 120:
whether their models are capable of
Page 121 and 122:
Incorporating Planning and Reasonin
Page 123 and 124:
see an unclaimed ten-dollar bill al
Page 125 and 126:
swering user queries) and the state
Page 127 and 128:
Program Representation for General
Page 129 and 130:
distribution P corresponding to the
Page 131 and 132:
with all Ei replaced by the unbound
Page 133 and 134:
Consciousness in Human and Machine:
Page 135 and 136:
at the sensory receptors, is pre-pr
Page 137 and 138:
The second choice is to take a deta
Page 139 and 140:
Hebbian Constraint on the Resolutio
Page 141 and 142:
The Case of Inputs that Change by T
Page 143 and 144:
This route is relevant if the noise
Page 145 and 146:
1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148:
Our implemented Turing judge used a
Page 149 and 150:
2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152:
UnsupervisedSegmentationofAudioSpee
Page 153 and 154:
FinallyweusedthediscreteFourierTran
Page 155 and 156:
Secondly,thereexistsmorethanonelogi
Page 157 and 158:
Parsing PCFG within a General Proba
Page 159 and 160:
known in advance, although many imp
Page 161 and 162:
Figure 2: Shows the underlying weig
Page 163 and 164:
Self-Programming: Operationalizing
Page 165 and 166:
expressivity. More over, self-progr
Page 167 and 168:
existence of a distance function ov
Page 169 and 170: Bootstrap Dialog: A Conversational
Page 171 and 172: elation is typically a verb. In the
Page 173 and 174: node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176: Analytical Inductive Programming as
Page 177 and 178: Problem domain: puttable(x) PRE: cl
Page 179 and 180: eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182: Human and Machine Understanding Of
Page 183 and 184: case orderings t correspond to the
Page 185 and 186: in (1) received a different denotat
Page 187 and 188: Abstract This paper analyzes the di
Page 189 and 190: Having grounded meaning: In an inte
Page 191 and 192: to experience” can be argued to b
Page 193 and 194: Abstract Case-by-case Problem Solvi
Page 195 and 196: First, since NARS is designed in th
Page 197 and 198: typical response is to find such an
Page 199 and 200: What Is Artificial General Intellig
Page 201 and 202: Finally, the facts that both seem t
Page 203 and 204: differ. NARS uses Narsese, the fair
Page 205 and 206: Integrating Action and Reasoning th
Page 207 and 208: states, with the condition that the
Page 209 and 210: action model. The system is able to
Page 211 and 212: Neuroscience and AI Share the Same
Page 213 and 214: Relevance Based Planning: Why Its a
Page 215 and 216: General Intelligence and Hypercompu
Page 217 and 218: To appear, AGI-09 1 Stimulus proces
Page 219: Distribution of Environments in For
Page 223 and 224: Improving the Believability of Non-
Page 225 and 226: Understanding the Brain’s Emergen
Page 227 and 228: Abstract The challenge of creating
Page 229 and 230: Importing Space-time Concepts Into
Page 231 and 232: HELEN: Using Brain Regions and Mech
Page 233 and 234: Holistic Intelligence: Transversal
Page 235 and 236: Achieving Artificial General Intell
Page 237: Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?