2018
DOI: 10.1007/978-3-030-01258-8_30
|View full text |Cite
|
Sign up to set email alerts
|

Visual Question Generation for Class Acquisition of Unknown Objects

Abstract: Traditional image recognition methods only consider objects belonging to already learned classes. However, since training a recognition model with every object class in the world is unfeasible, a way of getting information on unknown objects (i.e., objects whose class has not been learned) is necessary. A way for an image recognition system to learn new classes could be asking a human about objects that are unknown. In this paper, we propose a method for generating questions about unknown objects in an image, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 29 publications
0
8
0
Order By: Relevance
“…In future work, the extracted keywords will be utilized in other tasks, such as VQA, object classification, or object detection. This work could also be combined with recent works on VQG (Uehara et al, 2018;Shen et al, 2019). In these works, the system generates questions to acquire information from humans.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In future work, the extracted keywords will be utilized in other tasks, such as VQA, object classification, or object detection. This work could also be combined with recent works on VQG (Uehara et al, 2018;Shen et al, 2019). In these works, the system generates questions to acquire information from humans.…”
Section: Discussionmentioning
confidence: 99%
“…Thus, to train image recognition models, it is important for real-world intelligent systems to actively acquire information. One promising approach to acquire information on the fly is learning by asking, i.e., generating questions to humans about unknown objects, and consequently learning new knowledge from the human response (Misra et al, 2018;Uehara et al, 2018;Shen et al, 2019). This implies that if we can build a Visual Question Answering (VQA) system (Antol et al, 2015) that functions in the real Figure 1: Example of the proposed task -keyword extraction from full-sentence VQA.…”
Section: Introductionmentioning
confidence: 99%
“…However, questions are limited to templates, and training is done in synthetic environments with a limited set of objects and relationships. [28] uses questions to explore new object classes for image classification. However, [28] does not retrain their classifier.…”
Section: Related Workmentioning
confidence: 99%
“…[28] uses questions to explore new object classes for image classification. However, [28] does not retrain their classifier. Our work differs from [33,28] by proposing a way for the agent to learn in a lifetime setting.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation