2019 28th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) 2019
DOI: 10.1109/ro-man46459.2019.8956444
|View full text |Cite
|
Sign up to set email alerts
|

Learning Socially Appropriate Robot Approaching Behavior Toward Groups using Deep Reinforcement Learning

Abstract: Deep reinforcement learning has recently been widely applied in robotics to study tasks such as locomotion and grasping, but its application to social human-robot interaction (HRI) remains a challenge. In this paper, we present a deep learning scheme that acquires a prior model of robot approaching behavior in simulation and applies it to real-world interaction with a physical robot approaching groups of humans. The scheme, which we refer to as Staged Social Behavior Learning (SSBL), considers different stages… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(22 citation statements)
references
References 42 publications
0
17
0
1
Order By: Relevance
“…Later, various studies started to exploit the spatial configuration of the sensor/joint network. For instance, several data representations considered the relationships between sensors/joints [29,36,81], with network architectures designed to enable local processing of movement dynamics [13,19,39,40,44,75,80]. Performance improvements achieved by these methods suggest that body configuration information is important for activity recognition.…”
Section: Human Activity Recognitionmentioning
confidence: 99%
“…Later, various studies started to exploit the spatial configuration of the sensor/joint network. For instance, several data representations considered the relationships between sensors/joints [29,36,81], with network architectures designed to enable local processing of movement dynamics [13,19,39,40,44,75,80]. Performance improvements achieved by these methods suggest that body configuration information is important for activity recognition.…”
Section: Human Activity Recognitionmentioning
confidence: 99%
“…Lathuilière et al [11] uses Deep RL to learn a gaze policy from an intrinsic reward function based on the audiovisual position of people with respect to the robot camera field of view. Gao et al [8] learns a robot policy for approaching groups of people by maximizing a group formation score and minimizing the displacement of other participants in the group when the robot approaches.…”
Section: Generating Social Behavioursmentioning
confidence: 99%
“…Without having studies which understand how human beings interpret the behaviour of robotic systems and what are their expectations of such systems, it would be impossible, for example, to help humans trust their robot counterparts in critical and non-critical situations [15]. On this subject, Han et al [16] proposed a literature review, while interesting studies from a computer science, artificial intelligence, cognitive psychology, and philosophical [29] Automated rationale generation Recurrent neural networks [26] Goal communication, favourizing user anticipation Inverse reinforcement learning [63] Training personalized policies for teaching Model-free affective reinforcement learning Tega [67] Teaching a robot to learn toy names and the locations Reinforcement learning Kasper [42] Robot approaching behaviour to groups of humans Proximal policy optimization Pepper [35] Adaptively decide a monitoring distance and an approaching direction to improve user activity recognition performance perspective can be found in a recent full-day workshop [17]. For example, the Theory of Mind (ToM) model is used by both human and robot in order to understand each other in an autonomous car driving on the highway scenario [18].…”
Section: Explainable Behavioursmentioning
confidence: 99%
“…Online learning of repeated stochastic games has also been suggested [40] as an efficient method to learn to collaborate with people through cheap talk [41]. In Gao et al [42], proximal policy optimization (PPO) was used to learn robot approaching behaviour, while in Vitiello et al [43], the robot proxemics is adapted with a neuro-fuzzy-Bayesian system.…”
Section: Machine Learning In Hrimentioning
confidence: 99%