2021
DOI: 10.36227/techrxiv.17104526
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Mutual impact of acoustic and linguistic representations for continuous emotion recognition in call-center conversations

Abstract: <div> <div> <div> <p>The goal of our research is to automaticaly retrieve the satisfaction and the frustration in real-life call-center conversations. This study focuses an industrial application in which the customer satisfaction is continuously tracked down to improve customer services. To compensate the lack of large annotated emotional databases, we explore the use of pre-trained speech representations as a form of transfer learning towards AlloSat corpus. Moreover, seve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 48 publications
(64 reference statements)
0
2
0
Order By: Relevance
“…Human emotion recognition by machines aids various research fields, such as virtual reality, gaming, robotics, and customer care operations. For instance, if an automated call center system is able to infer a customer's emotional state, the application may be able to provide more appropriate responses or send the call to a human operator straightaway [2]. Emotion detection systems can be used in virtual reality and games to detect a player's emotional suffering, paving the way for more realistic, engaging, and immersive gaming experiences SA 5042, Australia.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Human emotion recognition by machines aids various research fields, such as virtual reality, gaming, robotics, and customer care operations. For instance, if an automated call center system is able to infer a customer's emotional state, the application may be able to provide more appropriate responses or send the call to a human operator straightaway [2]. Emotion detection systems can be used in virtual reality and games to detect a player's emotional suffering, paving the way for more realistic, engaging, and immersive gaming experiences SA 5042, Australia.…”
Section: Introductionmentioning
confidence: 99%
“…In contrast to automated speech recognition (ASR) datasets, commonly used Speech Emotion Recognition datasets [8]- [10] are restricted in complexity and small in size. Furthermore, systems trained on these datasets may not be generalizable to other domains, such as customer service centers [2]. As a result, self-supervised pre-trained models such as wave-tovector (wav2vec) [11] and Bidirectional Encoder Representations from Transformers (BERT) [12] have been created to address the above-mentioned issue by learning from largescale audio and text datasets without the need of extensive labeling.…”
Section: Introductionmentioning
confidence: 99%