TESIS DOCTORAL - Robotics Lab - Universidad Carlos III de Madrid

More documents

Recommendations

Info

“empuja” el robot a evitar las situaciones peligrosas. Los detalles y las ventajas de estasemociones en un robot real se muestran empíricamente a lo largo de este libro.El robot decide sus acciones futuras en base a lo que ha aprendido en experienciaspasadas. A pesar de que el contexto actual del robot está limitado a un laboratorio, el robotsocial cohabita con personas en un entorno potencialmente no-determinístico. El robot estáequipado con un repertorio de acciones pero, inicialmente, no sabe qué acción ejecutar nicuando hacerlo. De echo, tiene que aprender la política de comportamiento, esto es, quéacción ejecutar en diferentes configuraciones del mundo (en cada estado) para satisfacerla necesidad relacionada con la motivación más alta. Puesto que el robot aprende en unentorno real interaccionando con distintos objetos, es necesario que este aprendizaje serealice en un tiempo aceptable.El algoritmo de aprendizaje que se utiliza es una variación del conocido Q-Learning, elObject Q-Learning. Mediante este algoritmo el robot aprende el valor de cada par estadoaccióna través de interacción con el entorno. Esto significa, que aprende el valor de cadaacción in cada posible estado. Cuanto más alto sea el valor, mejor es la acción en ese estado.Al inicio del proceso de aprendizaje, estos valores, llamados valores Q, pueden tenertodos el mismo valor o pueden pueden tener asignados distintos valores. En el primer caso,el robot no dispone de conocimientos previos; en el segundo, el robot dispone de cierta informaciónsobre la acción a elegir. Estos valores serán actualizados durante el aprendizaje.La emoción de miedo es especialmente estudiada en esta tesis. La forma de generarseesta emoción (el appraisal) y las reacciones al miedo resultan realmente útiles a la hora dedotar al robot con un mecanismo de supervivencia adaptable y fiable. Esta tesis presenta unrobot social que utiliza un proceso particular para el aprendizaje de nuevos “liberadores”del miedo, es decir, dispone de la capacidad de identificar nuevas situaciones peligrosas.Además, mediante el sistema de toma de decisiones, el robot aprende diferente reaccionespara protegerse ante posibles daños causados por diversos eventos impredecibles. De echo,estas reacciones al miedo son bastante similares a las reacciones al miedo que se puedenobservar en la naturaleza.Otro reto importante es el diseño de la solución: el sistema de toma de decisiones tieneque diseñarse de forma que sea suficientemente flexible para permitir cambiar fácilmentela configuración o incluso para aplicarse a distintos robots.Teniendo en cuenta el enfoque bioinspirado de este trabajo, esta investigación (y muchosotros trabajos relacionados) surge como un intento de entender un poco más lo quesucede en el cerebro. El autor espera que esta tesis pueda ayudar en el estudio de los procesosmentales, en particular aquellos que pueden llevar a desórdenes mentales o cognitivos.viii
ContentsAgradecimientosAbstractResumeniiivvii1 Introduction 11.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.1.1 Cognitive robotics . . . . . . . . . . . . . . . . . . . . . . . . . . 31.1.2 Autonomy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31.1.3 Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61.2 The problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81.3 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81.4 Overview of the contents . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Biological foundations 132.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132.2 The origin of behavior . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132.2.1 Innate vs learned . . . . . . . . . . . . . . . . . . . . . . . . . . . 132.2.2 Unconscious involuntary vs conscious voluntary . . . . . . . . . . 142.2.3 Homeostasis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142.3 Motivated behavior . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192.3.1 The Hull’s drive-reduction theory . . . . . . . . . . . . . . . . . . 192.3.2 Motivations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20ix
Page 1: TESIS DOCTORALBIO-INSPIRED DECISION
Page 7 and 8: AgradecimientosSon muchas las veces
Page 9 and 10: AbstractRobotics is an emergent fie
Page 11: ResumenLa robótica es un área eme
Page 15 and 16: 5 The social robot Maggie and its d
Page 18 and 19: 9.4 Harm/interactions with Alvaro d
Page 20 and 21: 3.10 An overview of the net of syst
Page 23: List of Algorithms6.1 Object Q-Lear
Page 26 and 27: xxii
Page 28 and 29: 2 Chapter 1. IntroductionFigure 1.1
Page 30 and 31: 4 Chapter 1. Introductionautonomous
Page 32 and 33: 6 Chapter 1. IntroductionAs in othe
Page 34 and 35: 8 Chapter 1. Introductiondesired ou
Page 36 and 37: 10 Chapter 1. Introduction1.4 Overv
Page 38 and 39: 12 Chapter 1. Introduction
Page 40 and 41: 14 Chapter 2. Biological foundation
Page 62 and 63:
36 Chapter 2. Biological foundation
Page 64 and 65:
38 Chapter 3. State of the Artand b
Page 66 and 67:
40 Chapter 3. State of the Art(a) R
Page 68 and 69:
42 Chapter 3. State of the Artpatie
Page 70 and 71:
44 Chapter 3. State of the Art(a) i
Page 72 and 73:
46 Chapter 3. State of the Artrange
Page 74 and 75:
48 Chapter 3. State of the Artwell
Page 76 and 77:
50 Chapter 3. State of the Artthe a
Page 78 and 79:
52 Chapter 3. State of the ArtThe e
Page 80 and 81:
54 Chapter 3. State of the Arttask.
Page 82 and 83:
56 Chapter 3. State of the Artthe r
Page 84 and 85:
58 Chapter 3. State of the Artit is
Page 86 and 87:
60 Chapter 3. State of the Artthe r
Page 88 and 89:
62 Chapter 3. State of the Artnon-l
Page 90 and 91:
64 Chapter 3. State of the ArtTAME
Page 92 and 93:
66 Chapter 3. State of the ArtMinsk
Page 94 and 95:
68 Chapter 3. State of the Art
Page 96 and 97:
70 Chapter 4. The Decision Making S
Page 98 and 99:
Page 100 and 101:
Page 102 and 103:
Page 104 and 105:
Page 106 and 107:
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
Page 114 and 115:
88 Chapter 5. The social robot Magg
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Page 126 and 127:
100 Chapter 5. The social robot Mag
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
110 Chapter 6. Learning to make dec
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
128 Chapter 7. Implementing the dec
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
Page 176 and 177:
Page 178 and 179:
Page 180 and 181:
Page 182 and 183:
Page 184 and 185:
Page 186 and 187:
Page 188 and 189:
Page 190 and 191:
Page 192 and 193:
166 Chapter 8. Testing the experime
Page 194 and 195:
Page 196 and 197:
Page 198 and 199:
Page 200 and 201:
Page 202 and 203:
Page 204 and 205:
178 Chapter 9. Experimental Results
Page 206 and 207:
Page 208 and 209:
Page 210 and 211:
Page 212 and 213:
Page 214 and 215:
Page 216 and 217:
Page 218 and 219:
Page 220 and 221:
Page 222 and 223:
Page 224 and 225:
198 Chapter 10. Conclusions and Fut
Page 226 and 227:
Page 228 and 229:
Page 230 and 231:
Page 232 and 233:
Page 234 and 235:
208 Bibliography[8] M. A. Martínez
Page 236 and 237:
210 Bibliography[35] B. Hardy-Vall
Page 238 and 239:
212 Bibliography[63] J. LeDoux, “
Page 240 and 241:
214 Bibliography[90] C. Bartneck an
Page 242 and 243:
216 Bibliography[115] B. Graf, U. R
Page 244 and 245:
218 Bibliography[140] W. P. Lee, J.
Page 246 and 247:
220 Bibliography[166] C. Isbell, C.
Page 248 and 249:
222 Bibliography[190] M. A. Salichs
show all

TESIS DOCTORAL - Robotics Lab - Universidad Carlos III de Madrid

Create successful ePaper yourself

Delete template?

Save as template?