13.07.2015 Views

TESIS DOCTORAL - Robotics Lab - Universidad Carlos III de Madrid

TESIS DOCTORAL - Robotics Lab - Universidad Carlos III de Madrid

TESIS DOCTORAL - Robotics Lab - Universidad Carlos III de Madrid

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

5 The social robot Maggie and its <strong>de</strong>cision making system 875.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 875.2 The robot Maggie . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 875.3 The Automatic-Deliberative control architecture . . . . . . . . . . . . . . . 895.3.1 Deliberative level . . . . . . . . . . . . . . . . . . . . . . . . . . . 905.3.2 Automatic level . . . . . . . . . . . . . . . . . . . . . . . . . . . . 905.3.3 AD Communications . . . . . . . . . . . . . . . . . . . . . . . . . 915.3.4 AD Skill . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 915.4 Featuring Maggie’s DMS . . . . . . . . . . . . . . . . . . . . . . . . . . . 925.4.1 The robot’s inner world: what drives and motivations? . . . . . . . 935.4.2 The external world: sensing and acting . . . . . . . . . . . . . . . 995.4.3 Acting in the world: what to do next? . . . . . . . . . . . . . . . . 1045.4.4 The consequences of the robot’s actions . . . . . . . . . . . . . . . 1065.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1076 Learning to make <strong>de</strong>cisions 1096.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1096.2 Object Q-Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1096.2.1 The state space . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1106.2.2 Reduced state space . . . . . . . . . . . . . . . . . . . . . . . . . 1116.2.3 Collateral effects and Object-Q learning . . . . . . . . . . . . . . . 1136.2.4 The algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1146.3 Enhancing the learning process . . . . . . . . . . . . . . . . . . . . . . . . 1226.3.1 Well-balanced Exploration . . . . . . . . . . . . . . . . . . . . . . 1226.3.2 Amplified Reward . . . . . . . . . . . . . . . . . . . . . . . . . . 1246.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1257 Implementing the <strong>de</strong>cision making system 1277.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1277.2 Decision Making System database <strong>de</strong>sign . . . . . . . . . . . . . . . . . . 1287.3 Decision Making System class <strong>de</strong>sign . . . . . . . . . . . . . . . . . . . . 1337.3.1 The external robot’s world class <strong>de</strong>sign . . . . . . . . . . . . . . . 1367.3.2 The inner robot’s world class diagram . . . . . . . . . . . . . . . . 1387.4 How the external state is perceived . . . . . . . . . . . . . . . . . . . . . . 1417.4.1 The location monitor . . . . . . . . . . . . . . . . . . . . . . . . . 1437.4.2 The music player sensor . . . . . . . . . . . . . . . . . . . . . . . 1457.4.3 The docking station sensor . . . . . . . . . . . . . . . . . . . . . . 1457.4.4 The bluetooth discoverer . . . . . . . . . . . . . . . . . . . . . . . 1467.4.5 The rfid discoverer . . . . . . . . . . . . . . . . . . . . . . . . . . 1497.5 How the robot interacts with the objects . . . . . . . . . . . . . . . . . . . 151xi

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!