A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

To appear, AGI-09 2 neural activity self−generated internal dynamics sensory data input stream time Figure 1: Cartoon of an autonomously active cognitive system. The self-generated eigendynamics in terms of a time series of transient states (top). This dynamical activity is a priori not related to the sensory data input stream (bottom), carrying environmental information. Emergent cognitive capabilities need to result from the emergence of correlation between the internal activity and the sensory data stream through unsupervised online learning. sponses, or to no response at all, depending on the currently ongoing internal activity (Arieli et al. 1996). This state of affairs may be interpreted from two different perspectives. On one side one may view the brain as a finite-state machine for which the next state depends both on the current state and on the input. From a neurobiological perspective one may regard on the other side the brain as an autonomously active system for which the internal dynamics is modulated, but not forcefully driven, by external stimuli. The second viewpoint allows to formulate a simple and yet very powerful principle governing the influence of the sensory data stream onto the internal activity. The internal transient-state dynamics is realized by competitive neural processes. This competition can be influenced and modulated by the sensory input when we assume that the sensory signals contribute to the internal neural competition on an equal basis. This principle has been implemented successfully (Gros & Kaczor 2008) and tested for the bars problem. The bars problem is a standardized setup for a non-linear independent component task. The exciting result is now the following (Gros & Kaczor 2008; Gros 2009a): We have a generalized neural network which is continuously and autonomously active on its own. There is no external teacher and no explicit coding of any algorithm, all learning rules are online and Hebbian-style. In this setup the competition of the internal, autonomously generated transient state dynamics, with the data input stream leads to an 205 emergent cognitive capability, an independent component analysis. As a result of this process the internal transient states, the attractor relics, are mapped via their respective receptive fields to objects present in the environment, the independent components. The internal transient state dynamics acquires such a semantic content and turns into an associative thought process. Conclusions This is a non-technical position paper and we refer to the literature both for details on our own work (Gros 2007; 2009b), as well as for a review on the experimental situation and on alternative approaches (Gros 2009a). These results demonstrate the feasibility of the basic concept, the emergence of cognitive capabilities through the interaction of the internal eigendynamics and the external influences arriving via the sensory organs. It is our believe that the field of autonomously active cognitive system constitutes a field of emergent importance, both for systems neuroscience as well as for the eventual development of general artificial intelligences. References Abeles, M.; Bergman, H.; Gat, I.; Meilijson, I.; Seidemann, E.; Tishby, N.; and Vaadia, E. 1995. Cortical Activity Flips Among Quasi-Stationary States. Proceedings of the National Academy of Sciences 92:8616– 8620. Arieli, A.; Sterkin, A.; Grinvald, A.; and Aertsen, A. 1996. Dynamics of Ongoing Activity: Explanation of the Large Variability in Evoked Cortical Responses. Science 273:1868. Gros, C., and Kaczor, G. 2008. Learning in cognitive systems with autonomous dynamics. In Proceedings of the International Conference on Cognitive Systems, Karlsruhe. Springer. Gros, C. 2007. Neural networks with transient state dynamics. New Journal of Physics 9:109. Gros, C. 2008. Complex and Adaptive Dynamical Systems: A Primer. Springer. Gros, C. 2009a. Cognitive computation with autonomously active neural networks: an emerging field. Cognitive Computation. (in press). Gros, C. 2009b. Emotions, diffusive emotional control and the motivational problem for autonomous cognitive systems. In Handbook of Research on Synthetic Emotions and Sociable Robotics: New Applications in Affective Computing and Artificial Intelligence. J. Vallverdu, D. Casacuberta (Eds.), IGI-Global. (in press). Kenet, T.; Bibitchkov, D.; Tsodyks, M.; Grinvald, A.; and Arieli, A. 2003. Spontaneously emerging cortical representations of visual attributes. Nature 425:954– 956. Konar, A. 2005. Computational Intelligence: Principles, Techniques And Applications. Springer.
Distribution of Environments in Formal Measures of Intelligence Abstract This paper shows that a constraint on universal Turing machines is necessary for Legg's and Hutter's formal measure of intelligence to be unbiased. It also explores the relation of the No Free Lunch Theorem to formal measures of intelligence. Introduction A formal definition of intelligence can provide a welldefined goal to those developing artificial intelligence. Legg's and Hutter's formal measure of the intelligence of agents interacting with environments provides such a definition (Legg and Hutter 2006). Their model includes weighting distributions over time and environments. The point of this paper is to argue that a constraint on the weighting over environments is required for the utility of the intelligence measure. A Formal Measure of Intelligence In Legg's and Hutter's measure, based on reinforcement learning, an agent interacts with its environment at a sequence of discrete times, sending action ai to the environment and receiving observation oi and reward ri from the environment at time i. These are members of finite sets A, O and R respectively, where R is a set of rational numbers between 0 and 1. The environment is defined by a probability measure μ( okrk | o1r1a1 … ok-1r k-1a k-1 ) and the agent is defined by a probability measure π(ak | o1r1a1 … ok-1r k-1a k-1 ). The value of agent π in environment μ is defined by the expected value of rewards: Vμ π = E(∑i=1 ∞ wiri) where the wi ≥ 0 are a sequence of weights for future rewards subject to ∑i=1 ∞ wi = 1 (Legg and Hutter combined the wi into the ri). In reinforcement learning the wi are often taken to be (1-γ)γ i-1 for some 0 < γ < 1. Note 0 ≤ Vμ π ≤ 1. The intelligence of agent π is defined by a weighted sum of its values over a set E of computable environments. Environments are computed by programs, finite binary strings, on some prefix universal Turing machine (PUTM) U. The weight for μ ∈ E is defined in terms of its Kolmogorov complexity: K(μ) = min { |p| : U(p) computes μ } Bill Hibbard University of Wisconsin – Madison SSEC, 1225 W. Dayton St., Madison, WI 53706 206 where |p| denotes the length of program p. The intelligence of agent π is: V π = ∑μ∈E 2 -K(μ) Vμ π . The value of this expression for V π is between 0 and 1 because of Kraft's Inequality for PUTMs (Li and Vitányi 1997): ∑μ∈E 2 -K(μ) ≤ 1. Legg and Hutter state that because K(μ) is independent of the choice of PUTM up to an additive constant that is independent of μ, we can simply pick a PUTM. They do caution that the choice of PUTM can affect the relative intelligence of agents and discuss the possibility of limiting PUTM complexity. But in fact a constraint on PUTMs is necessary to avoid intelligence measures biased toward specific environments: Proposition 1. Given μ ∈ E and ε > 0 there exists a PUTM Uμ such that for all agents π: Vμ π / 2 ≤ V π < Vμ π / 2 + ε where V π is computed using Uμ. Proof. Fix a PUTM U0 that computes environments. Given μ ∈ E and ε > 0, fix an integer n such that 2 -n < ε. Then construct a PUTM Uμ that computes μ given the program "1", fails to halt (alternatively, computes μ) given a program starting with between 1 and n 0's followed by a 1, and computes U0(p) given a program of n+1 0's followed by p. Now define K using Uμ. Clearly: 2 -K(μ) = 1/2 And, applying Kraft's Inequality to U0: ∑μ' ≠ μ 2 -K(μ') ≤ 2 -n < ε. So V π = Vμ π / 2 + X where X = ∑μ' ≠ μ 2 -K(μ') Vμ' π and 0 ≤ X < ε. Whatever PUTM is used to compute environments, all but an arbitrarily small ε of an agent's intelligence is determined by its value in a finite number of environments. Lucky choices of actions at early, heavily weighted time steps in simple, heavily weighted environments, may give a less intelligent agent an advantage greater than ε, that a more intelligent agent cannot make up by good choices of actions in very difficult, but lightly weighted environments. Note that as environment complexity increases, agents will require longer times to learn good actions. Thus, given a distribution of time weights that is constant over all environments, even the best agents will be unable to get any value as environment complexity
Page 2 and 3:
Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5:
Artificial General Intelligence Vol
Page 6:
Preface Artificial General Intellig
Page 9 and 10:
Organizing Committee Tsvi Achler U.
Page 11 and 12:
A formal framework for the symbol g
Page 13 and 14:
Importing Space-time Concepts Into
Page 15 and 16:
one structure, everything else adap
Page 17 and 18:
generalize is essential to understa
Page 19 and 20:
First Application The above framewo
Page 21 and 22:
problem solving, use of knowledge,
Page 23 and 24:
Of particular note is the separatio
Page 25 and 26:
Conclusions and Future Work While p
Page 27 and 28:
Conceptual Spaces The conceptual ar
Page 29 and 30:
perceptions and actions. The lingui
Page 31 and 32:
means of the knowledge stored in th
Page 33 and 34:
grams given some (syntactically res
Page 35 and 36:
For example, let reverse([]) = [] r
Page 37 and 38:
Igor2 produced solutions with auxil
Page 39 and 40:
evolve neural net modules as quickl
Page 41 and 42:
ain”, consisting of some dozen or
Page 43 and 44:
Population Chr NListPtr m Chrm is a
Page 45 and 46:
and shaping, we feel that exploring
Page 47 and 48:
Table 2 reviews the key capabilitie
Page 49 and 50:
One way to achieve this goal would
Page 51 and 52:
question. Our approach of staying a
Page 53 and 54:
H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 55 and 56:
Discussion and future work Testing
Page 57 and 58:
Types of Reasoning Corresponding Fo
Page 59 and 60:
Integrating Different Types of Reas
Page 61 and 62:
good overview of analogy models can
Page 63 and 64:
��
Page 65 and 66:
� ��
Page 67 and 68:
��
Page 69 and 70:
A Unified Framework for IP Conditio
Page 71 and 72:
negative evidence when added to a r
Page 73 and 74:
ing strategy. Due to GOLEM’s rand
Page 75 and 76:
any state index, n∈IN the current
Page 77 and 78:
is the best observation summary, wh
Page 79 and 80:
to a code for the rewards only, whi
Page 81 and 82:
have exponentially many states (2O(
Page 83 and 84:
where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86:
• The cost function can be improv
Page 87 and 88:
decides to introduce inflation or d
Page 89 and 90:
€ € The diffusion matrix is the
Page 91 and 92:
tuning may be done adaptively by te
Page 93 and 94:
Of course, Harnad’s argument is n
Page 95 and 96:
By exploring variations of Harnad
Page 97 and 98:
Discussion The representational sys
Page 99 and 100:
the cognitive map is less of a map
Page 101 and 102:
models of cognition except in cases
Page 103 and 104:
7(b), we can see that the straight
Page 105 and 106:
eaction times, error rates, or exac
Page 107 and 108:
comparing behavior with and without
Page 109 and 110:
Doorenbos, R. B. 1994. Combining le
Page 111 and 112:
egion, we leave the type of object
Page 113 and 114:
extract features in support of reco
Page 115 and 116:
with a minimum score of -1.0. The
Page 117 and 118:
the NASA Human Error Modeling compa
Page 119 and 120:
whether their models are capable of
Page 121 and 122:
Incorporating Planning and Reasonin
Page 123 and 124:
see an unclaimed ten-dollar bill al
Page 125 and 126:
swering user queries) and the state
Page 127 and 128:
Program Representation for General
Page 129 and 130:
distribution P corresponding to the
Page 131 and 132:
with all Ei replaced by the unbound
Page 133 and 134:
Consciousness in Human and Machine:
Page 135 and 136:
at the sensory receptors, is pre-pr
Page 137 and 138:
The second choice is to take a deta
Page 139 and 140:
Hebbian Constraint on the Resolutio
Page 141 and 142:
The Case of Inputs that Change by T
Page 143 and 144:
This route is relevant if the noise
Page 145 and 146:
1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148:
Our implemented Turing judge used a
Page 149 and 150:
2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152:
UnsupervisedSegmentationofAudioSpee
Page 153 and 154:
FinallyweusedthediscreteFourierTran
Page 155 and 156:
Secondly,thereexistsmorethanonelogi
Page 157 and 158:
Parsing PCFG within a General Proba
Page 159 and 160:
known in advance, although many imp
Page 161 and 162:
Figure 2: Shows the underlying weig
Page 163 and 164:
Self-Programming: Operationalizing
Page 165 and 166:
expressivity. More over, self-progr
Page 167 and 168: existence of a distance function ov
Page 169 and 170: Bootstrap Dialog: A Conversational
Page 171 and 172: elation is typically a verb. In the
Page 173 and 174: node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176: Analytical Inductive Programming as
Page 177 and 178: Problem domain: puttable(x) PRE: cl
Page 179 and 180: eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182: Human and Machine Understanding Of
Page 183 and 184: case orderings t correspond to the
Page 185 and 186: in (1) received a different denotat
Page 187 and 188: Abstract This paper analyzes the di
Page 189 and 190: Having grounded meaning: In an inte
Page 191 and 192: to experience” can be argued to b
Page 193 and 194: Abstract Case-by-case Problem Solvi
Page 195 and 196: First, since NARS is designed in th
Page 197 and 198: typical response is to find such an
Page 199 and 200: What Is Artificial General Intellig
Page 201 and 202: Finally, the facts that both seem t
Page 203 and 204: differ. NARS uses Narsese, the fair
Page 205 and 206: Integrating Action and Reasoning th
Page 207 and 208: states, with the condition that the
Page 209 and 210: action model. The system is able to
Page 211 and 212: Neuroscience and AI Share the Same
Page 213 and 214: Relevance Based Planning: Why Its a
Page 215 and 216: General Intelligence and Hypercompu
Page 217: To appear, AGI-09 1 Stimulus proces
Page 221 and 222: The Importance of Being Neural-Symb
Page 223 and 224: Improving the Believability of Non-
Page 225 and 226: Understanding the Brain’s Emergen
Page 227 and 228: Abstract The challenge of creating
Page 229 and 230: Importing Space-time Concepts Into
Page 231 and 232: HELEN: Using Brain Regions and Mech
Page 233 and 234: Holistic Intelligence: Transversal
Page 235 and 236: Achieving Artificial General Intell
Page 237: Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?