2020
DOI: 10.48550/arxiv.2010.04502
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Background Learnable Cascade for Zero-Shot Object Detection

Ye Zheng,
Ruoran Huang,
Chuanqi Han
et al.

Abstract: Zero-shot detection (ZSD) is crucial to large-scale object detection with the aim of simultaneously localizing and recognizing unseen objects. There remain several challenges for ZSD, including reducing the ambiguity between background and unseen objects as well as improving the alignment between visual and semantic concept. In this work, we propose a novel framework named Background Learnable Cascade (BLC) to improve ZSD performance. The major contributions for BLC are as follows: (i) we propose a multi-stage… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(7 citation statements)
references
References 46 publications
0
7
0
Order By: Relevance
“…We compare our method with the state-of-the-art zeroshot detection approaches on two split benchmarks in Table 2 over ZSD setting. We can observe that: (i) for 48/17 split, we compare our approaches with SB [23], DSES [23], TD [29], PL [26], Gtnet [30], DELO [31] and BLC [32]. Our method surpasses all of them, brings up to 36.99% and 11.08% gain in terms of Recall@100 and mAP; (ii) for 65/15 split, compared with PL [26] and BLC [32], our method still gets the best performance and brings 21.18% gain for Recall@100 and 1.2% improvement for mAP.…”
Section: Comparison With Other Zsd Methodsmentioning
confidence: 98%
See 4 more Smart Citations
“…We compare our method with the state-of-the-art zeroshot detection approaches on two split benchmarks in Table 2 over ZSD setting. We can observe that: (i) for 48/17 split, we compare our approaches with SB [23], DSES [23], TD [29], PL [26], Gtnet [30], DELO [31] and BLC [32]. Our method surpasses all of them, brings up to 36.99% and 11.08% gain in terms of Recall@100 and mAP; (ii) for 65/15 split, compared with PL [26] and BLC [32], our method still gets the best performance and brings 21.18% gain for Recall@100 and 1.2% improvement for mAP.…”
Section: Comparison With Other Zsd Methodsmentioning
confidence: 98%
“…Zhu et al [31] propose DELO that synthesizes visual features for unseen objects from semantic information and incorporate the detection for seen and unseen objects. Zheng et al [32] boost the ZSD performance with a cascade structure to progressively refine the visual-semantic mapping relationship and propose BLRPN to learn the word-vector for background class. In terms of the background representation, SB [23] and PL [26] use the fixed word-vector for "background" word, DSES [23] and BLC [32] try to learn a more reasonable representation.…”
Section: Zero-shot Learningmentioning
confidence: 99%
See 3 more Smart Citations