2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) 2020
DOI: 10.1109/icdl-epirob48136.2020.9278057
|View full text |Cite
|
Sign up to set email alerts
|

Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset

Abstract: Modern social intelligence includes the ability to watch videos and answer questions about social and theoryof-mind-related content, e.g., for a scene in Harry Potter, "Is the father really upset about the boys flying the car?" Social visual question answering (social VQA) is emerging as a valuable methodology for studying social reasoning in both humans (e.g., children with autism) and AI agents. However, this problem space spans enormous variations in both videos and questions. We discuss methods for creatin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 19 publications
0
1
0
Order By: Relevance
“…Social-IQ [13] is an unconstrained benchmark that introduces the task of Social Video Question Answering. It consists of human-centered videos in the wild along with social and theory-of-mind-related questions, and answering can demand sophisticated combinations of language understanding, cultural knowledge, logical and causal reasoning, on top of nonsocial layers of comprehension about physical events [14].…”
Section: Introductionmentioning
confidence: 99%
“…Social-IQ [13] is an unconstrained benchmark that introduces the task of Social Video Question Answering. It consists of human-centered videos in the wild along with social and theory-of-mind-related questions, and answering can demand sophisticated combinations of language understanding, cultural knowledge, logical and causal reasoning, on top of nonsocial layers of comprehension about physical events [14].…”
Section: Introductionmentioning
confidence: 99%