A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

H ∗ 0 δB(H ∗ 0 ) δF(H∗ 0 ) H∗ 1 δB(H ∗ 1 ) δF(H∗ 1 ) H∗ 2 δB(H ∗ 2 ) δF(H∗ 2 ) H∗ 3 δB(H ∗ 3 ) δF(H∗ 3 ) H∗ 4 δB(H ∗ 4 ) δF(H∗ 4 ) H∗ 5 0 ½ 0 0 1 0 ½ 1 ½ 1 1 1 1 1 1 1 1 1 ½ 1 1 ½ 1 1 ½ 1 1 ½ 1 1 ½ 1 0 0 ½ 0 0 ½ 0 0 ½ 0 0 ½ 0 0 ½ 0 ½ 0 ½ 0 0 ½ 0 0 ½ 0 0 0 0 0 0 0 0 ½ ½ ½ ½ ½ ½ 1 ½ 1 1 ½ 1 1 1 1 1 1 ½ 1 1 ½ 1 1 ½ 1 1 ½ 1 1 ½ 1 0 0 ½ 0 0 ½ 0 0 ½ 0 0 ½ 0 0 ½ 0 ½ 0 1 ½ 0 ½ 0 0 ½ 0 0 0 0 0 0 0 ½ ½ ½ ½ ½ ½ ½ ½ ½ ½ 1 ½ 1 1 1 1 Figure 2: The first three iterations of values of H ∗ 0 = (0, 1, 0, ½, 0, 1, 0, ½, ½) up to the point of stability (H ∗ 4 = (1, 1, 0, 0, 1, 1, 0, 0, 1)). Jones and the secretary about the meeting of the library committee (of which Miller is not a member). Jones replies: “We have the faculty meeting at 10am and the library committee meeting at 11am; by the way, don’t believe anything that Miller says, as he is always wrong.” Miller replies: “The faculty meeting was cancelled; by the way don’t believe anything that Jones says, as he is always wrong.” The secretary replies: “The faculty meeting is at 10 am and the library committee meeting is at 11 am. But I am sure that Professor Miller told you already as he is always such an accurate person and quick in answering e-mails: everything Miller says is correct.” (GLS07, p. 408) Trying to analyse Smith’s reasoning process after he returns from his trip, we can assume that he generally believes the secretary’s opinions, and that he has no prior idea about the truth value of the statements “the faculty meeting is at 10am” and “the library meeting is at 11am” and the utterances of Miller and Jones. We have a vague intuition that tells us that in this hypothetical situation, Smith should at least come to the conclusion that the library meeting will be held at 11am (as there is positive, but no negative evidence). His beliefs about the faculty meeting are less straightforward, as there is some positive evidence, but also some negative evidence, and there is the confusing fact that the secretary supports Miller’s statement despite the disagreement in truth value. In (GLS07, p. 409), the authors analysed this example with their real-valued model and ended up with a stable solution in which Smith accepted both appointments and took Jones’s side (disbelieving Miller). In our system, we now get the following analysis: A pointer system formulation is given as follows. p0 ← ¬p1 ∧ ¬p4, p1 ← ¬p0 ∧ p2 ∧ p4, p2 ← , p3 ← p0 ∧ p2 ∧ p4, p4 ← , where p0 is Miller’s utterance, p1 is Jones’s utterance, p2 is “the library meeting will take place at 11am”, p3 is the secretary’s utterance, and p4 is the “the faculty meeting will take place at 10am”. We identify our starting hypothesis with H := (½, ½, ½, 1, ½) (here, as usual, we identify an interpretation with its sequence of values in the order of the indices of the propositional letters). Then δB(H) = (½, ½, ½, ½, ½) and δF(H) = (½, ½, 1, 1, ½), so that we get H ′ := δT(H) = (½, ½, 1, 1, ½). Then, in the second iteration step, δB(H ′ ) = 41 (½, ½, 1, ½, ½) and δF(H ′ ) = (½, ½, 1, 1, ½), so we obtain stability at δT(H ′ ) = H ′ . Examples of nonlogical behaviour In what follows, we investigate some logical properties of the belief semantics, viz. negation and disjunction, focussing on stable hypotheses. To some extent, our results show that the operator δT is rather far from the logical properties of δB discussed in Propositions 3 and 4. Negation Consider the pointer system Σ given by p0 ← ¬p3, p1 ← , p2 ← , p3 ← p1 ∧ p2. The interpretation H := (0, 1, 0, ½) is δT-stable, as δB(H) = (½, 1, 0, 0), δF(H) = (0, ½, ½, 1), and thus δT(H) = H. Now let us consider the system ¬(Σ, p3). Remember from the proof of Proposition 3 that stable interpretations for the small system could be extended to stable interpretations for the big system by plugging in the expected value for p¬. So, in this particular case, the stable value for p3 is ½, so we would expect that by extending H by H0(p¬) := ½, we would get another stable interpretation. But this is not the case, as the table of iterated values given in Figure 1 shows. Note that H0 is not even recurring. Disjunction Consider the pointer systems Σ and Σ ∗ and their disjunction (Σ, p4) ∨ (Σ ∗ , p ∗ 1) given as follows: p0 ← ¬p3, p ∗ 0 ← ¬p ∗ 3, p1 ← p ∗ 1, ← , p2 ← p ∗ 2, ← , p3 ← p1 ∧ p2, p ∗ 3 ← p ∗ 1 ∧ p ∗ 2, p∗ ← p4 ∨ p ∗ 1. Note that Σ and Σ ∗ are the same system up to isomorphism and that Σ is the system from the previous example. We already know that the interpretation H = (0, 1, 0, ½) is δTstable (therefore, it is δT-stable for both Σ and Σ ∗ in the appropriate reading. The natural extension of H to the full system with nine propositional variables would be H ∗ 0 := (0, 1, 0, ½, 0, 1, 0, ½, ½), as p∗ should take the value H(p4) ∨ H(p ∗ 1) = ½ ∨ 0 = ½. However, we see in Figure 2 that this interpretation is not stable (or even recurring).
Discussion and future work Testing the behaviour of our system on the liar sentence and one additional example cannot be enough as an empirical test of the adequacy of our system. After testing more examples and having developed some theoretical insight into the system and its properties, we would consider testing the system experimentally by designing situations in which people reason about beliefs in self-referential situations with evidence, and then compare the predictions of our system to the actual behaviour of human agents. Such an experimental test should not be done with just one system, but with a class of systems. We have already discussed that our choice of the connective ⊗ combining δB and δF to δT was not unique. Similarly, the rules for how to handle multiple solutions (“take the pointwise infimum”) in the case of forward propagation are not the only way to deal with this formally. One natural alternative option would be to split the sequence H ∞ into multiple sequences if there are multiple solutions. For instance, if we are trying to calculate δF(H) and we have multiple solutions to the set of equations, then δF(H) becomes a set of interpretations (possibly giving rise to different recurrences and stabilities, depending on which possibility you follow). There are many variants that could be defined, but the final arbiter for whether these systems are adequate descriptions of reasoning processes will have to be the experimental test. References Thomas Bolander. Logical Theories for Agent Introspection. PhD thesis, Technical University of Denmark, 2003. David Christensen. Putting Logic in its place. Formal Constraints on Rational Belief. Oxford University Press, 2007. Carlos Iván Chesñevar and Guillermo Ricardo Simari. A lattice-based approach to computing warranted beliefs in skeptical argumentation frameworks. 2007. In (Vel07, pp. 280–285). Marian E. Counihan. Looking for logic in all the wrong places: an investigation of language, literacy and logic in reasoning. PhD thesis, Universiteit van Amsterdam, 2008. ILLC Publications DS-2008-10. Haim Gaifman. Operational pointer semantics: Solution to self-referential puzzles I. In Moshe Y. Vardi, editor, Proceedings of the 2nd Conference on Theoretical Aspects of Reasoning about Knowledge, Pacific Grove, CA, March 1988, pages 43–59. Morgan Kaufmann, 1988. Haim Gaifman. Pointers to truth. Journal of Philosophy, 89(5):223–261, 1992. Anil Gupta and Nuel Belnap. The revision theory of truth. MIT Press, 1993. Sujata Ghosh, Benedikt Löwe, and Erik Scorelle. Belief flow in assertion networks. In Uta Priss, Simon Polovina, and Richard Hill, editors, Conceptual Structures: Knowledge Architectures for Smart Applications, 15th International Conference on Conceptual Structures, ICCS 2007, Sheffield, UK, July 22-27, 2007, Proceedings, volume 4604 of Lecture Notes in Computer Science, pages 401–414. Springer, 2007. 42 Alan H. Goldman. A note on the conjunctivity of knowledge. Analysis, 36:5–9, 1975. Geoff Hulten, David Maxwell Chickering, and David Heckerman. Learning Bayesian networks from dependency networks: A preliminary study. In Christopher M. Bishop and Brendan J. Frey, editors, Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics. Society for Artificial Intelligence and Statistics, 2003. Aaron Hunter and James P. Delgrande. Iterated belief change: A transition system approach. In Leslie Pack Kaelbling and Alessandro Saffiotti, editors, IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30- August 5, 2005, pages 460–465. Professional Book Center, 2005. Aaron Hunter and James P. Delgrande. An action description language for iterated belief change. 2007. In (Vel07, pp. 2498–2503). Stanley Kok and Pedro Domingos. Learning the structure of markov logic networks. In Luc De Raedt and Stefan Wrobel, editors, Proceedings of the 22nd International Machine Learning Conference, pages 441–448. ACM Press, 2005. Kristian Kerstin and Luc De Raedt. Bayesian logic programming: Theory and tool. In Lise Getoor and Ben Taskar, editors, Introduction to Statistical Relational Learning. MIT Press, 2007. Henry Kyburg. Conjunctivitis. In Marshall Swain, editor, Induction, Acceptance, and Rational Belief, pages 55–82. Reidel, 1970. Benedikt Löwe. Revision forever! In Henrik Schärfe, Pascal Hitzler, and Peter Øhrstrøm, editors, Conceptual Structures: Inspiration and Application, 14th International Conference on Conceptual Structures, ICCS 2006, Aalborg, Denmark, July 16-21, 2006, Proceedings, volume 4068 of Lecture Notes in Computer Science, pages 22–36. Springer, 2006. Benjamin Schnieder. On what we can ensure. Synthese, 162:101–115, 2008. Edward Stein. Without good reason. The rationality debate in philosophy and cognitive science. Clarendon Library of Logic and Philosophy. Clarendon Press, 1997. Manuela M. Veloso, editor. Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, Hyderabad, India, January 6-12, 2007. AAAI Press, 2007. Peter Wason. Reasoning about a rule. Quarterly Journal of Experimental Psychology, 20(3):273–281, 1968.
Page 2 and 3:
Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5: Artificial General Intelligence Vol
Page 6: Preface Artificial General Intellig
Page 9 and 10: Organizing Committee Tsvi Achler U.
Page 11 and 12: A formal framework for the symbol g
Page 13 and 14: Importing Space-time Concepts Into
Page 15 and 16: one structure, everything else adap
Page 17 and 18: generalize is essential to understa
Page 19 and 20: First Application The above framewo
Page 21 and 22: problem solving, use of knowledge,
Page 23 and 24: Of particular note is the separatio
Page 25 and 26: Conclusions and Future Work While p
Page 27 and 28: Conceptual Spaces The conceptual ar
Page 29 and 30: perceptions and actions. The lingui
Page 31 and 32: means of the knowledge stored in th
Page 33 and 34: grams given some (syntactically res
Page 35 and 36: For example, let reverse([]) = [] r
Page 37 and 38: Igor2 produced solutions with auxil
Page 39 and 40: evolve neural net modules as quickl
Page 41 and 42: ain”, consisting of some dozen or
Page 43 and 44: Population Chr NListPtr m Chrm is a
Page 45 and 46: and shaping, we feel that exploring
Page 47 and 48: Table 2 reviews the key capabilitie
Page 49 and 50: One way to achieve this goal would
Page 51 and 52: question. Our approach of staying a
Page 53: H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 57 and 58: Types of Reasoning Corresponding Fo
Page 59 and 60: Integrating Different Types of Reas
Page 61 and 62: good overview of analogy models can
Page 63 and 64: ��
Page 65 and 66: � ��
Page 67 and 68: ��
Page 69 and 70: A Unified Framework for IP Conditio
Page 71 and 72: negative evidence when added to a r
Page 73 and 74: ing strategy. Due to GOLEM’s rand
Page 75 and 76: any state index, n∈IN the current
Page 77 and 78: is the best observation summary, wh
Page 79 and 80: to a code for the rewards only, whi
Page 81 and 82: have exponentially many states (2O(
Page 83 and 84: where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86: • The cost function can be improv
Page 87 and 88: decides to introduce inflation or d
Page 89 and 90: € € The diffusion matrix is the
Page 91 and 92: tuning may be done adaptively by te
Page 93 and 94: Of course, Harnad’s argument is n
Page 95 and 96: By exploring variations of Harnad
Page 97 and 98: Discussion The representational sys
Page 99 and 100: the cognitive map is less of a map
Page 101 and 102: models of cognition except in cases
Page 103 and 104: 7(b), we can see that the straight
Page 105 and 106:
eaction times, error rates, or exac
Page 107 and 108:
comparing behavior with and without
Page 109 and 110:
Doorenbos, R. B. 1994. Combining le
Page 111 and 112:
egion, we leave the type of object
Page 113 and 114:
extract features in support of reco
Page 115 and 116:
with a minimum score of -1.0. The
Page 117 and 118:
the NASA Human Error Modeling compa
Page 119 and 120:
whether their models are capable of
Page 121 and 122:
Incorporating Planning and Reasonin
Page 123 and 124:
see an unclaimed ten-dollar bill al
Page 125 and 126:
swering user queries) and the state
Page 127 and 128:
Program Representation for General
Page 129 and 130:
distribution P corresponding to the
Page 131 and 132:
with all Ei replaced by the unbound
Page 133 and 134:
Consciousness in Human and Machine:
Page 135 and 136:
at the sensory receptors, is pre-pr
Page 137 and 138:
The second choice is to take a deta
Page 139 and 140:
Hebbian Constraint on the Resolutio
Page 141 and 142:
The Case of Inputs that Change by T
Page 143 and 144:
This route is relevant if the noise
Page 145 and 146:
1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148:
Our implemented Turing judge used a
Page 149 and 150:
2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152:
UnsupervisedSegmentationofAudioSpee
Page 153 and 154:
FinallyweusedthediscreteFourierTran
Page 155 and 156:
Secondly,thereexistsmorethanonelogi
Page 157 and 158:
Parsing PCFG within a General Proba
Page 159 and 160:
known in advance, although many imp
Page 161 and 162:
Figure 2: Shows the underlying weig
Page 163 and 164:
Self-Programming: Operationalizing
Page 165 and 166:
expressivity. More over, self-progr
Page 167 and 168:
existence of a distance function ov
Page 169 and 170:
Bootstrap Dialog: A Conversational
Page 171 and 172:
elation is typically a verb. In the
Page 173 and 174:
node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176:
Analytical Inductive Programming as
Page 177 and 178:
Problem domain: puttable(x) PRE: cl
Page 179 and 180:
eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182:
Human and Machine Understanding Of
Page 183 and 184:
case orderings t correspond to the
Page 185 and 186:
in (1) received a different denotat
Page 187 and 188:
Abstract This paper analyzes the di
Page 189 and 190:
Having grounded meaning: In an inte
Page 191 and 192:
to experience” can be argued to b
Page 193 and 194:
Abstract Case-by-case Problem Solvi
Page 195 and 196:
First, since NARS is designed in th
Page 197 and 198:
typical response is to find such an
Page 199 and 200:
What Is Artificial General Intellig
Page 201 and 202:
Finally, the facts that both seem t
Page 203 and 204:
differ. NARS uses Narsese, the fair
Page 205 and 206:
Integrating Action and Reasoning th
Page 207 and 208:
states, with the condition that the
Page 209 and 210:
action model. The system is able to
Page 211 and 212:
Neuroscience and AI Share the Same
Page 213 and 214:
Relevance Based Planning: Why Its a
Page 215 and 216:
General Intelligence and Hypercompu
Page 217 and 218:
To appear, AGI-09 1 Stimulus proces
Page 219 and 220:
Distribution of Environments in For
Page 221 and 222:
The Importance of Being Neural-Symb
Page 223 and 224:
Improving the Believability of Non-
Page 225 and 226:
Understanding the Brain’s Emergen
Page 227 and 228:
Abstract The challenge of creating
Page 229 and 230:
Importing Space-time Concepts Into
Page 231 and 232:
HELEN: Using Brain Regions and Mech
Page 233 and 234:
Holistic Intelligence: Transversal
Page 235 and 236:
Achieving Artificial General Intell
Page 237:
Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?