Proceedings of the 20th ACM International Conference on Multimodal Interaction 2018
DOI: 10.1145/3242969.3243029
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Dialogue Management for Multiparty Interaction with Infants

Abstract: We present dialogue management routines for a system to engage in multiparty agent-infant interaction. The ultimate purpose of this research is to help infants learn a visual sign language by engaging them in naturalistic and socially contingent conversations during an early-life critical period for language development (ages 6 to 12 months) as initiated by an artificial agent. As a first step, we focus on creating and maintaining agent-infant engagement that elicits appropriate and socially contingent respons… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(14 citation statements)
references
References 36 publications
0
11
0
Order By: Relevance
“…This study was part of a larger project designed to develop a system called RAVE (Robot AVatar thermal Enhanced language learning tool). RAVE is aimed to be an augmentative learning tool that can provide linguistic input, in particular visual language inputs, to facilitate language learning during one widely recognized critical developmental period for language (ages 6-12 months [97]) [96,[98][99][100][101]. To this end, thermal IR imaging was used to determine the emotional arousal and attentional valence, providing new knowledge about when infants are most optimally "Ready to Learn", even before the onset of language production.…”
Section: Thermal Ir Imaging-based Affective Computing In Hrimentioning
confidence: 99%
“…This study was part of a larger project designed to develop a system called RAVE (Robot AVatar thermal Enhanced language learning tool). RAVE is aimed to be an augmentative learning tool that can provide linguistic input, in particular visual language inputs, to facilitate language learning during one widely recognized critical developmental period for language (ages 6-12 months [97]) [96,[98][99][100][101]. To this end, thermal IR imaging was used to determine the emotional arousal and attentional valence, providing new knowledge about when infants are most optimally "Ready to Learn", even before the onset of language production.…”
Section: Thermal Ir Imaging-based Affective Computing In Hrimentioning
confidence: 99%
“…Thus, this is well past the early critical period for learning phonological units, phonological segmentation, categorization and mapping, and sequencing distributions -all vital to optimal, healthy language learning and reading. As such, there is a pressing opportunity for AI technology that can provide signed language input in the critical period of 6-12 months [20,21,34].…”
Section: Background and Motivationmentioning
confidence: 99%
“…The RAVE system includes two behavioral agents (a physical robot and a virtual human avatar on a screen) that can provide visual behaviors, as well as several sensor devices: an eye-tracker, thermal camera, and an interface for indicating communicative baby behaviors. Detailed description of the system's constituent components and dialogue algorithms are presented in [34], and [20], respectively. A preliminary evaluation of the system has been presented in [21].…”
Section: The Rave Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…While remote eye trackers allow estimating user gaze with high accuracy on computer screens, they are limited to their narrow ield-of-view and cannot be used with smaller surfaces (e.g., smartwatches). Eye tracking devices are used in experiments involving humans, even including infants [Franchak et al, 2011, Nasihati Gilani et al, 2018, that must not be harmed. Most pupil detection algorithms rely on head mounted eye cameras using an infrared light emitter attached next to the eye camera in order to seamlessly extract the pupil from the iris area in the imaging frame ( Figure 1).…”
mentioning
confidence: 99%