Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue 2015
DOI: 10.18653/v1/w15-4652
|View full text |Cite
|
Sign up to set email alerts
|

Incremental Coordination: Attention-Centric Speech Production in a Physically Situated Conversational Agent

Abstract: Inspired by studies of human-human conversations, we present methods for incrementally coordinating speech production with listeners' visual foci of attention. We introduce a model that considers the demands and availability of listeners' attention at the onset and throughout the production of system utterances, and that incrementally coordinates speech synthesis with the listener's gaze. We present an implementation and deployment of the model in a physically situated dialog system and discuss lessons learned. Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(14 citation statements)
references
References 10 publications
0
14
0
Order By: Relevance
“…If the user does not have the turn, the robot either has or takes the turn through its Robot's initiative sub-tree, and executes the presentation. Firstly, joint attention is ensured or grabbed (see Yu et al (2015)) if lost, this can be sensed in multiple ways (Ba and Odobez, 2009;Sheikhi, 2014;Szafir and Mutlu, 2012).…”
Section: Modelling the Presentationmentioning
confidence: 99%
“…If the user does not have the turn, the robot either has or takes the turn through its Robot's initiative sub-tree, and executes the presentation. Firstly, joint attention is ensured or grabbed (see Yu et al (2015)) if lost, this can be sensed in multiple ways (Ba and Odobez, 2009;Sheikhi, 2014;Szafir and Mutlu, 2012).…”
Section: Modelling the Presentationmentioning
confidence: 99%
“…The focus of studies based on these definitions was when and how the conversation starts and also ends. For example, one of the tasks was to detect the engaged person who wants to start the conversation with a situated robot [28,29]. The second type of definition is about the quality of the connection between participants during the dialogue.…”
Section: A) Definition Of Engagementmentioning
confidence: 99%
“…To realize smooth interaction, it is essential for the system to correctly identify who talks to whom and if the user is giving his/her attention toward ERICA (Yu et al, 2015). In this demonstration, we track the user's location and head orientation in the 3D space by using the Kinect v2 sensor.…”
Section: Speaker Tracking With Depth Cameramentioning
confidence: 99%