2019
DOI: 10.1609/aaai.v33i01.33018690
|View full text |Cite
|
Sign up to set email alerts
|

Zero-Shot Object Detection with Textual Descriptions

Abstract: Object detection is important in real-world applications. Existing methods mainly focus on object detection with sufficient labelled training data or zero-shot object detection with only concept names. In this paper, we address the challenging problem of zero-shot object detection with natural language description, which aims to simultaneously detect and recognize novel concept instances with textual descriptions. We propose a novel deep learning framework to jointly learn visual units, visual-unit attention a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
98
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 84 publications
(98 citation statements)
references
References 24 publications
0
98
0
Order By: Relevance
“…The study (Li et al 2019b) extends other the other two ZSD methods Bansal et al 2018) using the textual description embedding and report the results. We just copy and paste the results from the paper (Li et al 2019b) in Table 1 and Table 2.…”
Section: Dataset Descriptionmentioning
confidence: 91%
See 4 more Smart Citations
“…The study (Li et al 2019b) extends other the other two ZSD methods Bansal et al 2018) using the textual description embedding and report the results. We just copy and paste the results from the paper (Li et al 2019b) in Table 1 and Table 2.…”
Section: Dataset Descriptionmentioning
confidence: 91%
“…All the modules are trained using the Adam optimizer with a 10 −4 learning rate. Compared Methods: We compare the effect of our proposed approach to three recent state-of-the-art methods Bansal et al 2018;Li et al 2019b). In order to fairly compare with the method (Li et al 2019b), we use the textual description embedding which is trained by fastText (Edouard Grave and Bojanowski 2017) as the class semantic embedding.…”
Section: Dataset Descriptionmentioning
confidence: 99%
See 3 more Smart Citations