2018
DOI: 10.48550/arxiv.1811.08982
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Polarity Loss for Zero-shot Object Detection

Shafin Rahman,
Salman Khan,
Nick Barnes

Abstract: Zero-shot object detection is an emerging research topic that aims to recognize and localize previously 'unseen' objects. This setting gives rise to several unique challenges, e.g., highly imbalanced positive vs. negative instance ratio, ambiguity between background and unseen classes and the proper alignment between visual and semantic concepts. Here, we propose an end-to-end deep learning framework underpinned by a novel loss function that seeks to properly align the visual and semantic cues for improved zer… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
62
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(62 citation statements)
references
References 46 publications
0
62
0
Order By: Relevance
“…Region proposal-based approaches instead first generate region proposals, then classify (and update) each region proposal [3], [4], [6], [7], [10], [55], [56]. Recently, zero-shot object detection methods have been proposed [23], [24], [25], [26], [27], [57], [58], typically as extensions of supervised object detection models. Among these studies, Bansal et al [25] proposes a two-step approach which first locates object proposals from low-level features [59] and then classifies the resulting candidate regions using a ZSL model.…”
Section: Zero-shot Object Detectionmentioning
confidence: 99%
See 2 more Smart Citations
“…Region proposal-based approaches instead first generate region proposals, then classify (and update) each region proposal [3], [4], [6], [7], [10], [55], [56]. Recently, zero-shot object detection methods have been proposed [23], [24], [25], [26], [27], [57], [58], typically as extensions of supervised object detection models. Among these studies, Bansal et al [25] proposes a two-step approach which first locates object proposals from low-level features [59] and then classifies the resulting candidate regions using a ZSL model.…”
Section: Zero-shot Object Detectionmentioning
confidence: 99%
“…Demirel et al [23] proposes a regressionbased ZSD model that jointly incorporates two ZSL models based on convex combinations of semantic embeddings [60] and bi-linear compatibility models [16]. Rahman et al [26] proposes a polarity loss term that is based on the focal loss approach, to tackle better alignment between visual and semantic domains. Hence, semantic representations of visually similar classes get closer to each other.…”
Section: Zero-shot Object Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…In terms of the MS COCO dataset, we follow the same procedures described in [11] to divide the dataset into two different splits: (1) 48 seen and 17 unseen classes; (2) 65 seen and 15 unseen classes. Conditioned on the above seen/unseen class splits, we follow the steps in [62] to create the train and test set for each dataset. Implementation Details.…”
Section: Training and Inference Detailsmentioning
confidence: 99%
“…TD [14] learns both visual-unit-level and word-level attention to tackle the ZSD task with textual descriptions instead of a single word. PL [62] designs a novel polarity loss for RetinaNet based ZSD framework to better align visual and semantic concepts. BLC [56] combines Cascade Semantic R-CNN, semantic information flow and background learnable RPN into a unified framework for the ZSD task.…”
Section: Training and Inference Detailsmentioning
confidence: 99%