A Socially Assistive Robot Exercise Coach for the Elderly

More documents

Recommendations

Info

Fasola & Matarić. A SAR Exercise Coach for the Elderlyproduced by the TTS engine and locates the key start and end points of silence, whichcorrespond to pauses in the speech; for example, due to commas or periods in the inputtext. Then, at the appropriate times during speech playback, the speech module sends lipmovement commands to the action module to simulate the opening and closing of therobot’s mouth during speech. The lip synchronization procedure is used to enhance thenaturalness of the interaction to promote user engagement with the robot. The alternativeof keeping the robot’s lips closed during speech might appear incongruent to the user andthus have a disengaging effect. The speech module also communicates speech stateinformation to the behavior module, such as whether or not it is currently speaking, sothat the behavior finite state machines can make transitions accordingly (e.g., by waitingfor the robot to finish giving a praise comment before demonstrating the next exercisegesture).User Communication Module. The user communication module is responsible forreceiving direct input from the user and relaying the communication attempt to thebehavior module. The input in our SAR system is via a Nintendo Wiimote wirelessremote control device, which is capable of sending button presses wirelessly viaBluetooth. The simplicity of the input method allows the user to request breaks duringinteraction, respond to yes/no questions or prompts made by the robot, and also to reportkey information during the exercise games (e.g., notifying the robot of a gesturecompletion attempt during the Memory game). Each communication type is representedby a different button specifically labeled on the remote control. This method enablessimple two-way communication between the robot and user, is easy to use, and avoids thesensing difficulties often found with alternative communication input methods (e.g.,speech recognition).Behavior Module. The behavior module is responsible for producing all of the coachingbehaviors of the system, including steering the interaction and exercise games, providingverbal feedback, demonstrating actions, responding to user input, monitoring user activityand progress in the task, and recording important information to the database. Thebehavior module therefore communicates with each of the six system modules andrepresents the system’s main loop. The behavior module also keeps track of importantsession variables, including session time and total duration, current game time, andsession schedule.Action Module. The action module is responsible for sending motor commands to therobot to execute. These include facial expressions (e.g., moving eyebrows and lips) andarm movements. The module was designed to promote robot/agent platformindependence, and as a result, all action requests made by connected system modules areonly translated to the robot platform-dependent motor commands in the action module,thereby abstracting this knowledge away from all other system modules. This hierarchyallows for seamless integration of different platforms for the social agent, without theneed for modification of any of the remaining software modules. In the user studypresented in Section 4, we explored the use of both a physical humanoid robot and avirtual simulation of the same robot for system evaluation, thus demonstrating theinteroperability of the system design.12
Fasola & Matarić. A SAR Exercise Coach for the ElderlyDatabase Module. The database module is responsible for storing important user-relatedinformation regarding task performance and progress, in addition to session history.Examples include the date and time of previous sessions, exercise performance in each ofthe exercise games, number of breaks taken, high scores, etc. This information isavailable for retrieval during interaction by the behavior module, which uses theinformation to implement continuity behaviors and for reporting user performance andprogress. The database information is also useful for obtaining quantitative metrics ofuser performance for post-session/user study system evaluation.3.5 User Activity RecognitionIn order to monitor user performance and provide accurate feedback during the exerciseroutines, the robot must be able to recognize the user’s arm gestures. To accomplish this,we developed a vision algorithm that recognizes the user’s arm gestures/poses in realtime, with minimal constraints on the surrounding environment and the user.Several different approaches have been developed to accomplish tracking of humanmotions, both in 2D and 3D, including skeletonization methods (Fujiyoshi & Lipton,1998; Jenkins, Chu, & Matarić, 2004), gesture recognition using probabilistic methods(Waldherr, Thrun, Romero, & Margaritis, 1998), and color-based tracking (Wren,Azarbayejani, Darrell, & Pentland, 1997), among others. We opted to create an arm poserecognition system that takes advantage of our simplified exercise setup in order toachieve real-time results without imposing any markers on the user.To simplify the visual recognition of the user, a black curtain was used to provide astatic and contrasting background for fast segmentation of the user’s head and hands, themost important features of the arm pose recognition task, independent of the user’s skintone. The arm pose recognition algorithm consists of the following four major steps:1) Create segmented image: The original grayscale camera frame is converted into ablack/white image by applying a single threshold over the image. All pixels below thethreshold are set to black, and the rest to white. The threshold is set by the experimenterto appropriately segment out the user from the background, accounting for the specificlighting of the environment. Figure 3(a) shows an original grayscale image captured fromthe camera, with the segmented image shown in Figure 3(b).2) Detect the user’s face: The OpenCV frontal face detector is used to determine thelocation and size of the user’s face. With these values, an estimate for the shoulderpositions on both sides of the body is made.3) Determine hand locations: The hand locations of the user are determined byexamining the extrema points of the body pixels (maximum/minimum white pixellocations along x and y directions away from body) inside the region above the chest lineand to the side of the face in the segmented image (subtracting the head and estimatedbody regions—represented by green and blue bounding boxes in Figure 3). The algorithmapplies a simple set of rules, or heuristics, to choose which extrema points correspond tothe hand location for a given arm. For example, if the highest white pixel (body pixel) inthe segmented image on the left side is further than the approximate shoulder-to-forearmdistance, then that is the left hand location.These extrema point heuristics are valid for this domain, as the silhouette of a persondemonstrating planar arm movements (i.e., only two degrees of freedom) will alwaysfeature at least one of these extrema points at each of the hand locations, due to thegeometric properties of planar two-link manipulators. This planar two-link assumptionthus validates the use of extrema point heuristics performed on a binary segmentation ofthe image (approximated silhouette) to detect the hand locations of the user.13
Page 1 and 2: A Socially Assistive Robot Exercise
Page 3 and 4: 2. Related Work2.1 Robots for the E
Page 5 and 6: Fasola & Matarić. A SAR Exercise C
Page 11: Fasola & Matarić. A SAR Exercise C
Page 35: Fasola & Matarić. A SAR Exercise C

A Socially Assistive Robot Exercise Coach for the Elderly

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?