Training Socially Engaging Robots: Modeling Backchannel Behaviors with Batch Reinforcement Learning

Hussain, Nusrah; Erzin, Engin; Sezgin, T. Metin; Yemez, Y.

doi:10.1109/taffc.2022.3190233

Cited by 7 publications

(2 citation statements)

References 79 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other methods train on hand-crafted examples through generative models [28,42]. For instance, predicting when to use backchanneling behaviors (i.e., providing feedback during conversation such as by nodding) has been learned through batch reinforcement learning [17] and recurrent neural networks [31]. Lastly, recent work has investigated how to learn cost functions for a target emotion from user feedback [49], or even learn an emotive latent space to model many emotions [40].…”

Section: Related Workmentioning

confidence: 99%

Generative Expressive Robot Behaviors using Large Language Models

Mahadevan,

Chien,

Brown

et al. 2024

Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction

View full text Add to dashboard Cite

People employ expressive behaviors to effectively communicate and coordinate their actions with others, such as nodding to acknowledge a person glancing at them or saying "excuse me" to pass people in a busy corridor. We would like robots to also demonstrate expressive behaviors in human-robot interaction. Prior work proposes rule-based methods that struggle to scale to new communication modalities or social situations, while data-driven methods require specialized datasets for each social situation the robot is used in. We propose to leverage the rich social context available from large language models (LLMs) and their ability to generate motion based on instructions or user preferences, to generate expressive robot motion that is adaptable and composable, building upon each other. Our approach utilizes few-shot chain-of-thought prompting to translate human language instructions into parametrized control code using the robot's available and learned skills. Through user studies and simulation experiments, we demonstrate that our approach produces behaviors that users found to be competent and easy to understand. Supplementary material can be found at https://generative-expressive-motion.github.io/. CCS CONCEPTS• Computing methodologies → Online learning settings.

show abstract

Section: Related Workmentioning

confidence: 99%

Generative Expressive Robot Behaviors using Large Language Models

Mahadevan,

Chien,

Brown

et al. 2024

Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction

View full text Add to dashboard Cite

show abstract

“…Ding et al [10] describes how an agent can be endowed to elicit conversations with older adults for delivering cognitive training. Similarly, Hussain et al [14] presented a method that learnt to produce non-verbal backchannels, and demonstrated how such feedback had an impact on participants' engagement. Inden et al [16] modelled five different strategies for feedback behaviour in a conversational agent and evaluated their effectiveness in a user study, showing that when the robot took into account the interlocutor's utterance and pauses, participants rated that strategy as more adequate than the others.…”

Section: Introductionmentioning

confidence: 99%

Implications of Robot Backchannelling in Cognitive Therapy

Andriella¹,

Torras²,

Alenyà³

2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The social ability of humans to provide active feedback during conversations is known as backchannelling. Recent work has recognised the importance of endowing robots with such social behaviour to make interactions more natural. Nonetheless, very little is known about how backchannelling should be designed in order to be detected and whether it can have an impact on users' behaviour and performance in cooperative tasks. In this article, we aim at evaluating the legibility of robot's backchannelling behaviour on Persons with Dementia (PwDs) and its effect on their performance when playing cognitive training exercises. Aiming to do so, a TIAGo robot was endowed with backchannelling behaviour generated by combining verbal and non-verbal cues. To evaluate our system, two user studies were carried out, in which the social signal was provided first by a human therapist and later on by a robot. Results indicate that patients were capable of identifying such kind of feedback. Nonetheless, our findings pointed out a significant difference in terms of performance between the two studies. They reveal how patients in the study with the robot overused the feedback to obtain the correct answer, putting in place a cheating mechanism that has led them to significantly worsen their performance. We conclude our work by discussing the implications of our findings when deploying robots in sensitive roles and possible solutions to address such unexpected behaviours.

show abstract