A Framework for Evaluating Early-Stage Human - of Marcus Hutter

More documents

Recommendations

Info

Table 2: Overview of systems’ restriction bias F (i1, . . . , in)/lhs rhs vi/ui ADATE ii ∈ XT TC(XT ) ilc FLIP CRG of E + inverse narrowing of CRG of E + — FFOIL ii ∈ XT TC(XT ) ∪ {true, false} il GOLEM ii ∈ TC(XT ) TC(XT ) ∪ {true, false} ilc IGOR I ii ∈ TC(XT ) TC(XT ) — IGOR II ii ∈ TC(XT ) TC(XT ) i MAGH. composition of functions from BK, higher-order via paramorphisms i if l let c case base case, where the foil gain may be misleading. FFOIL is heavily biased towards constructing the next clause to cover the most frequent function value in the remaining tuples. Where FFOIL has only very general lhss, GOLEM and FLIP try to be more flexible in discriminating the inputs there, but not effective enough. Random sampling is too unreliable for an optimal partition of the inputs, especially for more complex data structures or programs with many rules. FLIP generates the lhss using the CRG on basis of common subterms on the lhs and rhs of the examples. Necessary function-carrying subterms on both sides may be generalised and the lhss may tend to be overly general. Also, neither overlap of the lhss is prohibited, nor are the rules ordered. Consequently, one input may be matched by several rules resulting in a wrong output. The rhs are constructed via inverse narrowing inducing a huge search space, so with increasing complexity of examples the search becomes more and more intractable when relying solely on heuristics. MAGICHASKELLER is a promising example of including higher-order features into IP and shows how functions like map or filter can be applied effectively, when used advisedly, as some kind of program pattern or scheme. Nevertheless, MAGICHASKELLER and ADATE exhibits the usual pros and cons common to all search-based approaches: The more extensive the BK library, the more powerfull the synthesised programs are, the greater is the search space and the longer are the runs. However, contrarily to GOLEM, it is not mislead by partial solutions and shows again that only a complete search can be satisfactory for IP. IGOR I and IGOR II will have problems were many examples are required (mult/add & allodds), but will be in other respects very fast. Empirical Results As problems we have chosen some of those occurring in the accordant papers and some to bring out the specific strengths and weaknesses. They have the usual semantics on lists: multlast replaces all elements with the last and shiftr makes a right-shift of all elements in a list. Therefore it is necessary to access the last element for further computations. Further functions are lasts which applies last on a list of lists, isort which is insertion-sort, allodds checks for odd numbers, and weave alternates elements from two lists into one. For odd/even and mult/add both functions need to be learnt at once. The functions in odd/even are mutually recursive and need more than two rules, lasts, multlast, isort, reverse, 59 Table 3: Systems’ runtimes on different problems in seconds Problems Systems ADATE FFOIL FLIP lasts 365.62 0.7 ⊥ × 1.062 0.051 5.695 19.43 last 1.0 0.1 0.020 < 0.001 0.005 0.007 0.01 member 2.11 0.1 ⊥ 17.868 0.033 — 0.152 1.07 odd/even — < 0.1 ⊥ 0.130 — — 0.019 — multlast 5.69 < 0.1 448.900 ⊥ < 0.001 0.331 0.023 0.30 isort 83.41 × × 0.714 — 0.105 0.01 reverse 30.24 — — — 0.324 0.103 0.08 weave 27.11 0.2 134.240 ⊥ 0.266 0.001 ⊥ 0.022 ⊙ shiftr 20.14 < 0.1 ⊥ 448.550 ⊥ 0.298 0.041 0.127 157.32 mult/add — 8.1 ⊥ × — — ⊙ — allodds 466.86 0.1 ⊥ × 0.016 ⊥ 0.015 ⊥ ⊙ × GOLEM IGOR I — not tested × stack overflow ⊙ time out ⊥ wrong mult/add, allodds suggest to use function invention, but only reverse is explicitly only solvable with. lasts and allodds also split up in more than two rules if no function invention is applied. To solve member pattern matching is required, because equality is not provided. The function weave is especially interesting, because it demands either for iterating over more than one argument resulting in more than one base case, or swapping the arguments at each recursive call. Because FFOIL and GOLEM usually perform better with more examples, whereas FLIP, MAGICHASKELLER and IGOR II do better with less, each system got as much examples as necessary up to certain complexity, but then exhaustively, so no specific cherry-picking was allowed. For synthesising isort all systems had a function to insert into a sorted list, and the predicate < as background knowledge. FLIP needed an additional function if to relate the insert function with the
ing strategy. Due to GOLEM’s random sampling, the best result of ten runs have been chosen. Long run times and the failures of FLIP testify for the intractable search space induced by the inverse narrowing operator. Wrong programs are due to overlapping lhss and its generalisation strategy of the inputs. Despite its randomisation, GOLEM overtrumps FLIP due to its capability of introducing let-expressions (cf. multlast). IGOR I and IGOR II need function invention to balance this weak-point. On reverse and isort MAGICHASKELLER demonstrates the power of higher-order functions. Although it does not invent auxiliary functions, reverse was solved using its paramorphism over lists which provides some kind of accumulator. The paramorphisms are also the reason why MAGICHASKELLER fails with weave, since swapping the inputs with each recursive call does not fit in the schema induced by the paramorphism for lists. These results showed, that the field of IP is not yet fully researched, and there are improvements discovered since the “golden times” of ILP and still to be discovered. Basically, ILP systems need a vast number of I/O examples which is usually impractical for a normal user to provide. Contrarily, IFP systems get along with much less examples but are still much more reliable in their results than ILP systems. Among the IFP it is significant analytic approaches rule out ADATE or MAGICHASKELLER on more complex problems where the search space increases. Also the ability of generically inventing functions is a big advantage. Conclusions and Further Work Based on a uniform description of some well-known IP systems and as result of our empirical evaluation of IP systems on a set of representative sample problems, we could show that the analytical approach of IGOR II is highly promising. IGOR II can induce a large scope of recursive programs, including mutual recursion using a straight-forward technique for function invention. Background knowledge can be provided in a natural way. As consequence of IGOR II’s generalisation principle, induced programs are guaranteed to terminate and to be the least generalisations. Although construction of hypotheses is not restricted by some greedy heuristics, induction is highly time efficient. Furthermore, IGOR II works with minimal information provided by the user. It needs only a small set of positive I/O examples together with the data type specification of the target function and no further information such as schemes. Due to the nature of specification by example, IP systems in general, cannot scale up to complex software development problems. However, integrating IP principles in the software engineering process might relieve developers from specifying or coding specific functions. Although IGOR II cannot tackle problems of complex size, it can tackle problems which are intellectually complex and therefore might offer support to inexperienced programmers. Function invention for the outmost function without prior definition of the positions of recursive calls will be our greatest future challenge. Furthermore, we plan to include the introduction of let-expressions and higher-order functions (such as map, reduce, filter). 60 References Baader, F., and Nipkow, T. 1998. Term Rewriting and All That. United Kingdom: Cambridge University Press. Biermann, A. W.; Kodratoff, Y.; and Guiho, G. 1984. Automatic Program Construction Techniques. NY, Free Press. Flener, P., and Yilmaz, S. 1999. Inductive synthesis of recursive logic programs: Achievements and prospects. J. Log. Program. 41(2-3):141–195. Flener, P. 1996. Inductive logic program synthesis with Dialogs. In Muggleton, S., ed., Proc. of the 6th International Workshop on ILP, 28–51. Stockholm University Hernández-Orallo, J., and Ramírez-Quintana, M. J. 1998. Inverse narrowing for the induction of functional logic programs. In Freire-Nistal, et al., eds., Joint Conference on Declarative Programming, 379–392. Hofmann, M.; Kitzelmann, E.; and Schmid, U. 2008. Analysis and evaluation of inductive programming systems in a higher-order framework. In 31st German Conference on Artificial Intelligence, LNAI. Springer-Verlag. Hofmann, M. 2007. Automatic Construction of XSL Templates – An Inductive Programming Approach. VDM Verlag, Saarbrücken. Katayama, S. 2005. Systematic search for lambda expressions. In Trends in Functional Programming, 111–126. Kitzelmann, E., and Schmid, U. 2006. Inductive synthesis of functional programs: An explanation based generalization approach. J. of ML Research 7:429–454. Kitzelmann, E. 2007. Data-driven induction of recursive functions from input/output-examples. In Kitzelmann, E., and Schmid, U., eds., Proc. of the ECML/PKDD 2007 Workshop on Approaches and Applications of Inductive Programming, 15–26. Mitchell, T. M. 1997. Machine Learning. McGraw-Hill Higher Education. Muggleton, S., and Feng, C. 1990. Efficient induction of logic programs. In Proc. of the 1st Conference on Algorithmic Learning Theory, 368–381. Ohmsma, Tokyo, Japan. Muggleton, S. 1995. Inverse entailment and Progol. New Generation Computing, Special issue on Inductive Logic Programming 13(3-4):245–286. Olsson, R. J. 1995. Inductive functional programming using incremental program transformation. Artificial Intelligence 74(1):55–83. Plotkin, G. 1971. A further note on inductive generalization. In Machine Intelligence, vol. 6. Edinb. Univ. Press. Quinlan, J. R., and Cameron-Jones, R. M. 1993. FOIL: A midterm report. In Machine Learning: ECML-93, Proc., vol. 667, 3–20. Springer-Verlag. Quinlan, J. R. 1996. Learning first-order definitions of functions. Journal of AI Research 5:139–161. Summers, P. D. 1977. A methodology for LISP program construction from examples. Journal ACM 24:162–175. Terese. 2003. Term Rewriting Systems, vol. 55 of Cambridge Tracts in Theoretical Computer Science. Cambridge University Press.
Page 2 and 3:
Ben Goertzel, Pascal Hitzler, Marcu
Page 4 and 5:
Artificial General Intelligence Vol
Page 6:
Preface Artificial General Intellig
Page 9 and 10:
Organizing Committee Tsvi Achler U.
Page 11 and 12:
A formal framework for the symbol g
Page 13 and 14:
Importing Space-time Concepts Into
Page 15 and 16:
one structure, everything else adap
Page 17 and 18:
generalize is essential to understa
Page 19 and 20:
First Application The above framewo
Page 21 and 22: problem solving, use of knowledge,
Page 23 and 24: Of particular note is the separatio
Page 25 and 26: Conclusions and Future Work While p
Page 27 and 28: Conceptual Spaces The conceptual ar
Page 29 and 30: perceptions and actions. The lingui
Page 31 and 32: means of the knowledge stored in th
Page 33 and 34: grams given some (syntactically res
Page 35 and 36: For example, let reverse([]) = [] r
Page 37 and 38: Igor2 produced solutions with auxil
Page 39 and 40: evolve neural net modules as quickl
Page 41 and 42: ain”, consisting of some dozen or
Page 43 and 44: Population Chr NListPtr m Chrm is a
Page 45 and 46: and shaping, we feel that exploring
Page 47 and 48: Table 2 reviews the key capabilitie
Page 49 and 50: One way to achieve this goal would
Page 51 and 52: question. Our approach of staying a
Page 53 and 54: H0 δB(H0) δF(H0) H1 δB(H1) δF(H
Page 55 and 56: Discussion and future work Testing
Page 57 and 58: Types of Reasoning Corresponding Fo
Page 59 and 60: Integrating Different Types of Reas
Page 61 and 62: good overview of analogy models can
Page 63 and 64: ��
Page 65 and 66: � ��
Page 67 and 68: ��
Page 69 and 70: A Unified Framework for IP Conditio
Page 71: negative evidence when added to a r
Page 75 and 76: any state index, n∈IN the current
Page 77 and 78: is the best observation summary, wh
Page 79 and 80: to a code for the rewards only, whi
Page 81 and 82: have exponentially many states (2O(
Page 83 and 84: where ˆσ 2 =Loss( ˆw)/n. Given
Page 85 and 86: • The cost function can be improv
Page 87 and 88: decides to introduce inflation or d
Page 89 and 90: € € The diffusion matrix is the
Page 91 and 92: tuning may be done adaptively by te
Page 93 and 94: Of course, Harnad’s argument is n
Page 95 and 96: By exploring variations of Harnad
Page 97 and 98: Discussion The representational sys
Page 99 and 100: the cognitive map is less of a map
Page 101 and 102: models of cognition except in cases
Page 103 and 104: 7(b), we can see that the straight
Page 105 and 106: eaction times, error rates, or exac
Page 107 and 108: comparing behavior with and without
Page 109 and 110: Doorenbos, R. B. 1994. Combining le
Page 111 and 112: egion, we leave the type of object
Page 113 and 114: extract features in support of reco
Page 115 and 116: with a minimum score of -1.0. The
Page 117 and 118: the NASA Human Error Modeling compa
Page 119 and 120: whether their models are capable of
Page 121 and 122: Incorporating Planning and Reasonin
Page 123 and 124:
see an unclaimed ten-dollar bill al
Page 125 and 126:
swering user queries) and the state
Page 127 and 128:
Program Representation for General
Page 129 and 130:
distribution P corresponding to the
Page 131 and 132:
with all Ei replaced by the unbound
Page 133 and 134:
Consciousness in Human and Machine:
Page 135 and 136:
at the sensory receptors, is pre-pr
Page 137 and 138:
The second choice is to take a deta
Page 139 and 140:
Hebbian Constraint on the Resolutio
Page 141 and 142:
The Case of Inputs that Change by T
Page 143 and 144:
This route is relevant if the noise
Page 145 and 146:
1. Oculus Info. Inc. Toronto, Ont.
Page 147 and 148:
Our implemented Turing judge used a
Page 149 and 150:
2.7; Human vs. JabberWacky t(157.6)
Page 151 and 152:
UnsupervisedSegmentationofAudioSpee
Page 153 and 154:
FinallyweusedthediscreteFourierTran
Page 155 and 156:
Secondly,thereexistsmorethanonelogi
Page 157 and 158:
Parsing PCFG within a General Proba
Page 159 and 160:
known in advance, although many imp
Page 161 and 162:
Figure 2: Shows the underlying weig
Page 163 and 164:
Self-Programming: Operationalizing
Page 165 and 166:
expressivity. More over, self-progr
Page 167 and 168:
existence of a distance function ov
Page 169 and 170:
Bootstrap Dialog: A Conversational
Page 171 and 172:
elation is typically a verb. In the
Page 173 and 174:
node RDF proposition N1 N2 N3 N4 N5
Page 175 and 176:
Analytical Inductive Programming as
Page 177 and 178:
Problem domain: puttable(x) PRE: cl
Page 179 and 180:
eq Hanoi(0, Src, Aux, Dst, S) = mov
Page 181 and 182:
Human and Machine Understanding Of
Page 183 and 184:
case orderings t correspond to the
Page 185 and 186:
in (1) received a different denotat
Page 187 and 188:
Abstract This paper analyzes the di
Page 189 and 190:
Having grounded meaning: In an inte
Page 191 and 192:
to experience” can be argued to b
Page 193 and 194:
Abstract Case-by-case Problem Solvi
Page 195 and 196:
First, since NARS is designed in th
Page 197 and 198:
typical response is to find such an
Page 199 and 200:
What Is Artificial General Intellig
Page 201 and 202:
Finally, the facts that both seem t
Page 203 and 204:
differ. NARS uses Narsese, the fair
Page 205 and 206:
Integrating Action and Reasoning th
Page 207 and 208:
states, with the condition that the
Page 209 and 210:
action model. The system is able to
Page 211 and 212:
Neuroscience and AI Share the Same
Page 213 and 214:
Relevance Based Planning: Why Its a
Page 215 and 216:
General Intelligence and Hypercompu
Page 217 and 218:
To appear, AGI-09 1 Stimulus proces
Page 219 and 220:
Distribution of Environments in For
Page 221 and 222:
The Importance of Being Neural-Symb
Page 223 and 224:
Improving the Believability of Non-
Page 225 and 226:
Understanding the Brain’s Emergen
Page 227 and 228:
Abstract The challenge of creating
Page 229 and 230:
Importing Space-time Concepts Into
Page 231 and 232:
HELEN: Using Brain Regions and Mech
Page 233 and 234:
Holistic Intelligence: Transversal
Page 235 and 236:
Achieving Artificial General Intell
Page 237:
Achler, Tsvi . . . . . . . . . . .
show all

A Framework for Evaluating Early-Stage Human - of Marcus Hutter

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?