2009
DOI: 10.1007/s10458-009-9092-y
|View full text |Cite
|
Sign up to set email alerts
|

A probabilistic multimodal approach for predicting listener backchannels

Abstract: During face-to-face interactions, listeners use backchannel feedback such as head nods as a signal to the speaker that the communication is working and that they should continue speaking. Predicting these backchannel opportunities is an important milestone for building engaging and natural virtual humans. In this paper we show how sequential probabilistic models (e.g., Hidden Markov Model or Conditional Random Fields) can automatically learn from a database of human-to-human interactions to predict listener ba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
111
0
2

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 123 publications
(116 citation statements)
references
References 32 publications
3
111
0
2
Order By: Relevance
“…Furthermore, the dynamics of action and reaction have been utilized for generating the reactions of a conversational avatar to a user. For example, Morency et al [25] proposed a probabilistic model for predicting the timing of listener's head gesture by analyzing the time lag of the listener's backchannel against speaker's utterance in human-human dyad conversations. However, none of them did not address how such interactions are perceived by observers.…”
Section: Related Workmentioning
confidence: 99%
“…Furthermore, the dynamics of action and reaction have been utilized for generating the reactions of a conversational avatar to a user. For example, Morency et al [25] proposed a probabilistic model for predicting the timing of listener's head gesture by analyzing the time lag of the listener's backchannel against speaker's utterance in human-human dyad conversations. However, none of them did not address how such interactions are perceived by observers.…”
Section: Related Workmentioning
confidence: 99%
“…Similarly, the second problem may be remedied by clustering feedback (places in the speaker's speech that are followed by feedback signals from multiple listeners are more likely to contain a cue). Nevertheless, the form-features in feedback elicitation cues have proven informative enough to enable automatic detection of feedback elicitation cues in audiovisual datastreams and have been successfully used to model the feedback behaviour of virtual agents [17,20].…”
Section: Feedback Elicitationmentioning
confidence: 99%
“…So far, most research on feedback in dialogue agents has concentrated on the first of these two aspects, with particular emphasis placed on models of appropriate timing of backchannel feedback [26,20,18]. Recently, the increase in the capabilities of incremental natural language understanding, has directed attention to the question of what type of feedback should be provided [19,24,23].…”
Section: Using Human Feedback In Human-agent Dialoguementioning
confidence: 99%