2020
DOI: 10.48550/arxiv.2010.15225
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Visuospatial Dataset for Naturalistic Verb Learning

Abstract: We introduce a new dataset for training and evaluating grounded language models. Our data is collected within a virtual reality environment and is designed to emulate the quality of language data to which a pre-verbal child is likely to have access: That is, naturalistic, spontaneous speech paired with richly grounded visuospatial context. We use the collected data to compare several distributional semantics models for verb learning. We evaluate neural models based on 2D (pixel) features as well as feature-eng… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 18 publications
0
1
0
Order By: Relevance
“…Finally, while we attempted to test a broad range of phenomena, our list was by no means exhaustive. Future work should aim to examine other aspects of word learning not considered here, such as visual grounding of referents from other kinds of lexical classes such as verbs and adjectives (Ebert & Pavlick, 2020;Nikolaus & Fourtassi, 2021). 6 Although it is common in cross-situational word learning experiments to match the number of words and referents per trial, this form of similarity allows some additional flexibility in handling situations where the number of words differs from the number of referents, like when the set of words is a sentence in natural language and not every word can be mapped onto a visually grounded referent.…”
Section: Discussionmentioning
confidence: 99%
“…Finally, while we attempted to test a broad range of phenomena, our list was by no means exhaustive. Future work should aim to examine other aspects of word learning not considered here, such as visual grounding of referents from other kinds of lexical classes such as verbs and adjectives (Ebert & Pavlick, 2020;Nikolaus & Fourtassi, 2021). 6 Although it is common in cross-situational word learning experiments to match the number of words and referents per trial, this form of similarity allows some additional flexibility in handling situations where the number of words differs from the number of referents, like when the set of words is a sentence in natural language and not every word can be mapped onto a visually grounded referent.…”
Section: Discussionmentioning
confidence: 99%