Here - Agents Lab - University of Nottingham

Recommendations

Info

14. Džeroski, S., Raedt, L.D., Driessens, K.: Relational reinforcement learning. MachineLearning 43 (2001) 7–52 10.1023/A:1007694015589.15. Bordini, R.H., Dix, J., Dastani, M., Seghrouchni, A.E.F.: Multi-Agent Programming:Languages, Platforms and Applications. Volume 15 of Multiagent Systems,Artificial Societies, and Simulated Organizations. Springer (2005)16. Bordini, R.H., Dix, J., Dastani, M., Seghrouchni, A.E.F.: Multi-Agent Programming:Languages, Tools and Applications. Springer (2009)17. Broekens, J., Hindriks, K., Wiggers, P.: Reinforcement Learning as Heuristicfor Action-Rule Preferences. In: Programming Multi-Agent Systems (ProMAS).(2010)18. Hindriks, K.V., Riemsdijk, M.B.: Declarative agent languages and technologies vi.Springer-Verlag (2009) 215–23219. Bellman, R.E.: Dynamic Programming. Princeton University Press (1957)20. Watkins, C.J.: Learning from delayed rewards. PhD thesis, King’s College London(1989)21. Andre, D., Russell, S.J.: State abstraction for programmable reinforcement learningagents. In: Eighteenth national conference on Artificial intelligence, Menlo Park,CA, USA, American Association for Artificial Intelligence (2002) 119–12522. Pokahr, A., Braubach, L., Lamersdorf, W.: Jadex: A BDI reasoning engine. In:Multi-Agent Programming. Volume 15 of Multiagent Systems, Artificial Societies,And Simulated Organizations. Springer (2005) 149–17423. Subagdja, B., Sonenberg, L., Rahwan, I.: Intentional learning agent architecture.Autonomous Agents and Multi-Agent Systems 18 (2009) 417–47024. Singh, D., Sardina, S., Padgham, L.: Extending BDI plan selection to incorporatelearning from experience. Robotics and Autonomous Systems 58 (2010) 1067–107525. Singh, D., Sardina, S., Padgham, L., Airiau, S.: Learning context conditions forBDI plan selection. In: Proceedings of Autonomous Agents and Multi-Agent Systems(AAMAS). (May 2010) 325–33226. Singh, D., Sardina, S., Padgham, L., James, G.: Integrating learning into a BDIagent for environments with changing dynamics. In Toby Walsh, C.K., Sierra, C.,eds.: Proceedings of the International Joint Conference on Artificial Intelligence(IJCAI), Barcelona, Spain, AAAI Press (July 2011) 2525–253027. Anderson, J., Bothell, D., Byrne, M., Douglass, S., Lebiere, C., Qin, Y.: An integratedtheory of the mind. Psychological review 111(4) (2004) 103628. Fu, W., Anderson, J.: From recurrent choice to skill learning: A reinforcementlearningmodel. Journal of experimental psychology: General 135(2) (2006) 18429. Klahr, D., Langley, P., Neches, R.: Production system models of learning anddevelopment. The MIT Press (1987)30. Laird, J., Rosenbloom, P., Newell, A.: Chunking in soar: The anatomy of a generallearning mechanism. Machine Learning 1(1) (1986) 11–4631. Nason, S., Laird, J.: Soar-rl: Integrating reinforcement learning with soar. CognitiveSystems Research 6(1) (2005) 51–5932. Nejati, N., Langley, P., Konik, T.: Learning hierarchical task networks by observation.In: International Conference on Machine Learning, ACM Press (2006)665–67233. Slaney, J., Thiébaux, S.: Blocks world revisited. Artificial Intelligence 125(1-2)(2001) 119–15334. Gupta, N., Nau, D.: On the complexity of blocks-world planning. Artificial Intelligence56(2-3) (1992) 223–25435. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. TheJournal of Machine Learning Research 3 (2003) 1157–1182163
Author IndexAbdel-Naby, S., 69Alelchina, N., 117Behrens, T., 117Braubach, L., 23Broxvall, M., 69Collier, R., 85Díaz, A.F., 7Dastani, M., 39Dragone, M., 69Earle, C.B., 7Fredlund, L., 7Hindriks, K., 55, 117, 148Jander, K., 101Jordan, H.R., 85Lamersdorf, W., 101Lillis, D., 85Logan, B., 117Meyer, J., 39Pokahr, A., 23Ricci, A., 132Santi, A., 132Singh, D., 148Swords, D., 69Torre, L., 39Wei, C., 55Ziafati, P., 39
Page 2 and 3:
Proceedings of the Tenth Internatio
Page 4 and 5:
OrganisationOrganising CommitteeMeh
Page 6:
Table of ContentseJason: an impleme
Page 10 and 11:
in Sect. 3 the translation of the J
Page 12 and 13:
init_count(0).max_count(2000).(a)(b
Page 14 and 15:
For instance, a failure in the ERES
Page 16 and 17:
{plan, fun start_count_trigger/1,fu
Page 18 and 19:
single parameter, an Erlang record
Page 20 and 21:
1. Belief annotations. Even though
Page 22 and 23:
decisions taken during the design a
Page 24 and 25:
Conceptual Integration of Agents wi
Page 26 and 27:
Fig. 2. Active component structurep
Page 28 and 29:
the service provider component. As
Page 30 and 31:
Fig. 4. Web Service Invocationretri
Page 32 and 33:
01: public interface IBankingServic
Page 34 and 35:
tate them in the same way as in the
Page 36 and 37:
01: public interface IChartService
Page 38 and 39:
implementations being available for
Page 41:
deliberative behavior in BDI archit
Page 44 and 45:
layer modules (i.e. nodes) can be d
Page 46 and 47:
different methods to choose the cur
Page 48 and 49:
also a single scheduler module, imp
Page 50 and 51:
andom choice (OR), conditional choi
Page 52 and 53:
- Dealing with conflicts based on p
Page 54 and 55:
5. Brooks, R. A. (1991) Intelligenc
Page 56 and 57:
An Agent-Based Cognitive Robot Arch
Page 58 and 59:
It has been argued that building ro
Page 60 and 61:
EnvironmentHardwareLocal SoftwareC+
Page 62 and 63:
a cognitive layer can connect as a
Page 64 and 65:
can reliably be differentiated and
Page 66 and 67:
4 ExperimentTo evaluate the feasibi
Page 68 and 69:
learn or gain knowledge from experi
Page 70 and 71:
A Programming Framework for Multi-A
Page 72 and 73:
exchange and storage of tuples (key
Page 74 and 75:
Although some success [13] [14] hav
Page 76 and 77:
as well as important non-functional
Page 78 and 79:
component plans have been instantia
Page 80 and 81:
A in the example) can evaluate all
Page 83 and 84:
1. robot-1 issues a Localization(ro
Page 85 and 86:
ACKNOWLEDGMENTThis work has been su
Page 87 and 88:
The code was analysed both objectiv
Page 89 and 90:
a conversation is following. Additi
Page 91 and 92:
the context of a communication-heav
Page 93 and 94:
Table 1. Core Agent ProtocolsAgent
Page 95 and 96:
statistically significant using an
Page 97 and 98:
to the conversation and has a perfo
Page 99 and 100:
principal reasons. Firstly, it is a
Page 101 and 102:
2. Muldoon, C., O’Hare, G.M.P., C
Page 103 and 104:
In the following section we will at
Page 105 and 106:
DevelopmentProductionHuman Readabil
Page 107 and 108:
will then evaluate this new format
Page 109 and 110:
encoder, it is first checked if the
Page 111 and 112:
nents themselves. However, since th
Page 113 and 114: optimized for this format feature s
Page 115 and 116: Java serialization and Jadex Binary
Page 117 and 118: 10. P. Hoffman and F. Yergeau, “U
Page 119 and 120: Caching the results of previous que
Page 121 and 122: querying an agent’s beliefs and g
Page 123 and 124: or relative performance of each pla
Page 125 and 126: were run for 1.5 minutes; 1.5 minut
Page 127 and 128: Size N K n p c qry U c upd Update c
Page 129 and 130: epresentation. The cache simply act
Page 131 and 132: 6 ConclusionWe presented an abstrac
Page 133 and 134: Typing Multi-Agent Programs in simp
Page 135 and 136: 1 // agent ag02 iterations (" zero
Page 137 and 138: 3.1 simpAL OverviewThe main inspira
Page 139 and 140: 3.2 Typing Agents with Tasks and Ro
Page 141 and 142: Defining Agent Scripts in simpAL (F
Page 143 and 144: that sends a message to the receive
Page 145 and 146: * error: wrong type for the param v
Page 147 and 148: Given an organization model, it is
Page 149 and 150: Learning to Improve Agent Behaviour
Page 151 and 152: 2.1 Agent Programming LanguagesAgen
Page 153 and 154: choosing actions is to find a good
Page 155 and 156: 1 init module {2 knowledge{3 block(
Page 157 and 158: of a module. For example, to change
Page 159 and 160: if bel(on(X,Y), clear(X)), a-goal(c
Page 161 and 162: mance. Figure 2d shows the same A f
Page 163: the current percepts of the agent.
show all

Here - Agents Lab - University of Nottingham

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?