View - ResearchGate

More documents

Recommendations

Info

Emotion Recognition Agents for Speech Signal 83System Structure. The ER system is part of a new generation computerizedcall center that integrates databases, decision support systems, and differentmedia such as voice messages, e-mail messages and a WWW server intoone information space. The system consists of three processes: a wave filemonitor, a voice mail center and a message prioritizer. The wave file monitorreads periodically the contents of the voice message directory, compares it tothe list of processed messages, and, if a new message is detected, it processesthe message and creates a summary and an emotion description file. The summaryfile contains the following information: five numbers that describe thedistribution of emotions, and the length and percentage of silence in the message.The emotion description file stores data describing the emotional contentof each 1–3 second chunk of message. The prioritizer is a process that readssummary files for processed messages, sorts them taking into account theiremotional content, length and some other criteria, and suggests an assignmentof agents to return back the calls. Finally, it generates a web page, which listsall current assignments. The voice mail center is an additional tool that helpsoperators and supervisors to visualize the emotional content of voice messages.5. ConclusionWe have explored how well people and computers recognize emotions inspeech. Several conclusions can be drawn from the above results. First, decodingemotions in speech is a complex process that is influenced by cultural,social, and intellectual characteristics of subjects. People are not perfect indecoding even such manifest emotions as anger and happiness. Second, angeris the most recognizable and easier to portray emotion. It is also the most importantemotion for business. But anger has numerous variants (for example,hot anger, cold anger, etc.) that can bring variability into acoustic features anddramatically influence the accuracy of recognition. Third, pattern recognitiontechniques based on neural networks proved to be useful for emotion recognitionin speech and for creating customer relationship management systems.Notes1. Each utterance was recorded using a close-talk microphone. The first 100 utterances were recordedat 22-kHz/8 bit and the remaining 600 utterances at 22-kHz/16 bit.2. The rows and the columns represent true and evaluated categories, respectively. For example, thesecond row says that 11.9% of utterances that were portrayed as happy were evaluated as normal (unemotional),61.4% as true happy, 10.1% as angry, 4.1% as sad, and 12.5% as afraid.3. The speaking rate was calculated as the inverse of the average length of the voiced part of utterance.For all other parameters we calculated the following statistics: mean, standard deviation, minimum, maximum,and range. Additionally, for F0 the slope was calculated as a linear regression for voiced part ofspeech, i.e. the line that fits the pitch contour. We also calculated the relative voiced energy. Altogether wehave estimated 43 features for each utterance.4. We ran RELIEF-F for the s70 data set varying the number of nearest neighbors from 1 to 12, andordered features according their sum of ranks.
84 Socially Intelligent Agents5. The top 14 features are: F0 maximum, F0 standard deviation, F0 range, F0 mean, BW1 mean, BW2mean, energy standard deviation, speaking rate, F0 slope, F1 maximum, energy maximum, energy range,F2 range, and F1 range.6. The first set included the top 8 features (from F0 maximum to speaking rate), the second extendedthe first by the next 2 features (F0 slope and F1 maximum), and the third included all 14 top features.7. An ensemble consists of an odd number of neural network classifiers trained on different subsets.The ensemble makes a decision based on the majority voting principle.8. To train the experts, we used a two-layer backpropagation neural network architecture with a 8-element input vector, 10 or 20 nodes in the hidden sigmoid layer and one node in the output linear layer.We also used the same subsets of the s70 data set as training and test sets but with only two classes (forexample, angry – non-angry).9. To explore this approach, we used a two-layer backpropagation neural network architecture with a5-element input vector, 10 or 20 nodes in the hidden sigmoid layer and five nodes in the output linear layer.We selected five of the best experts and generated several dozens neural network recognizers.10. We created ensembles of 15 neural network recognizers for the 8-,10-, and 14-feature inputs andthe 10- and 20-node architectures. The average accuracy of the ensembles of recognizers lies in the range73–77% and achieves its maximum ∼77% for the 8-feature input and 10-node architecture.References[1] R. Banse and K.R. Scherer. Acoustic profiles in vocal emotion expression. Journal ofPersonality and Social Psychology, 70: 614–636, 1996.[2] R. van Bezooijen. The characteristics and recognizability of vocal expression of emotions.Foris, Drodrecht, The Netherlands, 1984.[3] J.E. Cahn. Generation of Affect in Synthesized Speech. In Proc. 1989 Conference ofthe American Voice I/O Society, pages 251–256. Newport Beach, CA, September 11–13,1989.[4] C. Darwin. The expression of the emotions in man and animals. University of ChicagoPress, 1965 (Original work published in 1872).[5] F. Dellaert, T. Polzin, and A. Waibel. Recognizing emotions in speech. In Proc. Intl. Conf.on Spoken Language Processing, pages 734–737. Philadelphia, PA, October 3–6, 1996.[6] C. Elliot and J. Brzezinski. Autonomous Agents as Synthetic Characters. AI Magazine,19: 13–30, 1998.[7] L. Hansen and P. Salomon. Neural Network Ensembles. IEEE Transactions on PatternAnalysis and Machine Intelligence. 12: 993–1001, 1990.[8] I. Kononenko. Estimating attributes: Analysis and extension of RELIEF. In L. De Raedtand F. Bergadano, editors, Proc. European Conf. On Machine Learning (ECML’94),pages 171–182. Catania, Italy, April 6–8, 1994.[9] I.R. Murray and J.L. Arnott. Toward the simulation of emotion in synthetic speech: Areview of the literature on human vocal emotions. J. Acoust. Society of America, 93(2):1097–1108, 1993.[10] R. Picard. Affective computing. MIT Press, Cambridge, MA, 1997.[11] K.R. Scherer, R. Banse, H.G. Wallbott, and T. Goldbeck. Vocal clues in emotion encodingand decoding. Motivation and Emotion, 15: 123–148, 1991.[12] N. Tosa and R. Nakatsu. Life-like communication agent: Emotion sensing character“MIC” and feeling session character “MUSE”. In Proc. Third IEEE Intl. Conf. on MultimediaComputing and Systems, pages 12–19. Hiroshima, Japan, June 17–23, 1996.
Page 2 and 3:
SOCIALLY INTELLIGENT AGENTSCreating
Page 4 and 5:
SOCIALLY INTELLIGENT AGENTSCreating
Page 6 and 7:
ContentsContributing Authors1Social
Page 8 and 9:
Contents21Experiences with Sparky,
Page 10 and 11:
Contributing AuthorsAude BillardCom
Page 12 and 13:
Contributing AuthorsxiPeyman Farati
Page 14 and 15:
Contributing AuthorsxiiiBernard Ogd
Page 16:
Contributing AuthorsxvNell TenhaafD
Page 19 and 20:
2 Socially Intelligent AgentsFigure
Page 21 and 22:
4 Socially Intelligent AgentsFigure
Page 23 and 24:
6 Socially Intelligent Agentsthe ag
Page 25 and 26:
8 Socially Intelligent AgentsThis r
Page 27 and 28:
10 Socially Intelligent Agentstrack
Page 29 and 30:
12 Socially Intelligent Agentsintel
Page 31 and 32:
14 Socially Intelligent Agentsof a
Page 33 and 34:
16 Socially Intelligent AgentsIn ch
Page 35 and 36:
18 Socially Intelligent AgentsFigur
Page 37 and 38:
20 Socially Intelligent Agents[8] P
Page 39 and 40:
22 Socially Intelligent Agentscontr
Page 42 and 43:
Understanding Social Intelligence 2
Page 44 and 45:
Understanding Social Intelligence 2
Page 46 and 47:
Chapter 3MODELING SOCIAL RELATIONSH
Page 48 and 49:
Modeling Social Relationship 31soci
Page 50 and 51: Modeling Social Relationship 33Figu
Page 52 and 53: Modeling Social Relationship 35seco
Page 54 and 55: Chapter 4DEVELOPING AGENTS WHO CANR
Page 56 and 57: Developing Agents Who Can Relate to
Page 62 and 63: Chapter 5PARTY HOSTS AND TOUR GUIDE
Page 64 and 65: Party Hosts and Tour Guides 47conve
Page 66 and 67: Party Hosts and Tour Guides 492.2 E
Page 68 and 69: Party Hosts and Tour Guides 514. Co
Page 70 and 71: Chapter 6INCREASING SIA ARCHITECTUR
Page 72 and 73: Adapting to Affect and Personality
Page 78 and 79: Chapter 7COOPERATIVE INTERFACE AGEN
Page 80 and 81: Cooperative Interface Agents 63cult
Page 82 and 83: Cooperative Interface Agents 65will
Page 84 and 85: Cooperative Interface Agents 67make
Page 86 and 87: Chapter 8PLAYING THE EMOTION GAME W
Page 88 and 89: Playing the Emotion Game with Feeli
Page 94 and 95: Chapter 9CREATING EMOTION RECOGNITI
Page 96 and 97: Emotion Recognition Agents for Spee
Page 98 and 99: Emotion Recognition Agents for Spee
Page 102 and 103: Chapter 10SOCIAL INTELLIGENCE FOR C
Page 104 and 105: Social Intelligence for Computers 8
Page 110 and 111: Chapter 11EGOCHAT AGENTA Talking Vi
Page 112 and 113: EgoChat Agent 95that members in a g
Page 114 and 115: EgoChat Agent 97virtualized-ego(A)v
Page 116 and 117: EgoChat Agent 99apply EgoChat to a
Page 118 and 119: Chapter 12ELECTRIC ELVESAdjustable
Page 120 and 121: Electric Elves 103user in the agent
Page 122 and 123: Electric Elves 105The delay MDP rea
Page 124 and 125: Electric Elves 107intervening, lead
Page 126 and 127: Chapter 13BUILDING EMPIRICALLY PLAU
Page 128 and 129: Building Empirically Plausible MAS
Page 134 and 135: Chapter 14ROBOTIC PLAYMATESAnalysin
Page 136 and 137: Analysing Interactive Competencies
Page 142 and 143: Chapter 15MOBILE ROBOTIC TOYS AND A
Page 144 and 145: Mobile Robotic Toys and Autism 127m
Page 146 and 147: Mobile Robotic Toys and Autism 129F
Page 148 and 149: Mobile Robotic Toys and Autism 131p
Page 150 and 151:
Chapter 16AFFECTIVE SOCIAL QUESTEmo
Page 152 and 153:
Affective Social Quest 135Figure 16
Page 154 and 155:
Affective Social Quest 137tion is t
Page 156 and 157:
Affective Social Quest 139parents w
Page 158 and 159:
Chapter 17PEDAGOGICAL SOAPSocially
Page 160 and 161:
Pedagogical Soap 1432. IPD Backgrou
Page 162 and 163:
Pedagogical Soap 145EmotionalApprai
Page 164 and 165:
Pedagogical Soap 147trol their chil
Page 166 and 167:
Chapter 18DESIGNING SOCIABLE MACHIN
Page 168 and 169:
Designing Sociable Machines 151Sens
Page 170 and 171:
Designing Sociable Machines 153crit
Page 172 and 173:
Designing Sociable Machines 155qual
Page 174 and 175:
Chapter 19INFANOIDA Babybot that Ex
Page 176 and 177:
Infanoid 159— are arranged in a 4
Page 178 and 179:
Infanoid 161subset of the environme
Page 180 and 181:
Infanoid 163To explain the origin o
Page 182 and 183:
Chapter 20PLAY, DREAMS AND IMITATIO
Page 184 and 185:
Play, Dreams and Imitation in Robot
Page 186 and 187:
Page 188 and 189:
Page 190 and 191:
Chapter 21EXPERIENCES WITH SPARKY,
Page 192 and 193:
Experiences with Sparky, a Social R
Page 194 and 195:
Page 196 and 197:
Page 198 and 199:
Chapter 22SOCIALLY SITUATED PLANNIN
Page 200 and 201:
Socially Situated Planning 183will
Page 202 and 203:
Socially Situated Planning 185chang
Page 204 and 205:
Socially Situated Planning 187of pl
Page 206 and 207:
Chapter 23DESIGNING FOR INTERACTION
Page 208 and 209:
Designing for Interaction 191force
Page 210 and 211:
Designing for Interaction 193ment b
Page 212 and 213:
Designing for Interaction 195ductio
Page 214 and 215:
Chapter 24ME, MY CHARACTER AND THE
Page 216 and 217:
Me, My Character and the Others 199
Page 218 and 219:
Page 220 and 221:
Page 222 and 223:
Chapter 25FROM PETS TO STORYROOMSCo
Page 224 and 225:
From PETS to StoryRooms 2074. A Sto
Page 226 and 227:
From PETS to StoryRooms 209realized
Page 228 and 229:
From PETS to StoryRooms 211own Stor
Page 230 and 231:
Chapter 26SOCIALLY INTELLIGENT AGEN
Page 232 and 233:
Socially Intelligent Agents in Educ
Page 234 and 235:
Page 236 and 237:
Page 238 and 239:
Chapter 27TOWARDS INTEGRATING PLOT
Page 240 and 241:
Towards Integrating Plot and Charac
Page 242 and 243:
Page 244 and 245:
Page 246 and 247:
Chapter 28THE COOPERATIVE CONTRACTI
Page 248 and 249:
The Cooperative Contract 231relies
Page 250 and 251:
The Cooperative Contract 233To supp
Page 252 and 253:
Chapter 29PERCEPTIONS OF SELF IN AR
Page 254 and 255:
Perceptions of Self 237semination o
Page 256 and 257:
Perceptions of Self 239Figure 29.1.
Page 258 and 259:
Perceptions of Self 241human senses
Page 260 and 261:
Chapter 30MULTI-AGENT CONTRACT NEGO
Page 262 and 263:
Multi-Agent Contract Negotiation 24
Page 264 and 265:
Page 266 and 267:
Page 268 and 269:
Chapter 31CHALLENGES IN AGENT BASED
Page 270 and 271:
Challenges in ABSS of Negotiation 2
Page 272 and 273:
Page 274 and 275:
Page 276 and 277:
Chapter 32ENABLING OPEN AGENT INSTI
Page 278 and 279:
Enabling Open Agent Institutions 26
Page 280 and 281:
Page 282 and 283:
Page 284 and 285:
Chapter 33EMBODIED CONVERSATIONAL A
Page 286 and 287:
ECA’s In E-Commerce Applications
Page 288 and 289:
Page 290 and 291:
Page 292 and 293:
Index"like-me" test 85,89“Giant3
Page 294 and 295:
INDEX 277educational computer games
Page 296 and 297:
INDEX 279mutual selection 243mutual
Page 298:
INDEX 281theory of dramatic writing
show all

View - ResearchGate

Create successful ePaper yourself

Delete template?

Save as template?