2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00902
|View full text |Cite
|
Sign up to set email alerts
|

OW-DETR: Open-world Detection Transformer

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
55
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 105 publications
(55 citation statements)
references
References 22 publications
0
55
0
Order By: Relevance
“…We show results for FPN and cascade models trained and evaluated on PACO-LVIS in Tab. 16. Cascade models improve the performance for all but the largest model.…”
Section: E Additional Zero-shot Instance Detection Resultsmentioning
confidence: 95%
See 2 more Smart Citations
“…We show results for FPN and cascade models trained and evaluated on PACO-LVIS in Tab. 16. Cascade models improve the performance for all but the largest model.…”
Section: E Additional Zero-shot Instance Detection Resultsmentioning
confidence: 95%
“…Merge: After the splitting phase the annotated groups are very coherent, i.e., the majority of occurrences in the same group belong to the same instance. However due to video fragmentation and additional limitation on the number of boxes that can be shown to annotators (16) many occurrence groups belong to the same instance and need to be merged. To address this we use similarity in DINO model [2] embedding space.…”
Section: A3 Annotation Pipeline A31 Instance Annotationmentioning
confidence: 99%
See 1 more Smart Citation
“…Pseudo labels play an important role in guiding models to detect unknown object instances, determining the upper learning limitation of the model. The existing methods [13,17] only use model-driven pseudo-labelling and do not take full advantage of the inputs' priori knowledge (light flow, textures, 𝑒𝑡𝑐). The model-driven pseudo-labelling [13] makes the model's learning get caught up in the knowledge of known objects, for the reason that the only source of knowledge for the model is known object instances.…”
Section: Self-adaptive Pseudo-labellingmentioning
confidence: 99%
“…However, its success largely relies on a held-out validation set which is leveraged to estimate the distribution of unknown objects in the energy-based classifier. To alleviate the problems in ORE, OW-DETR [13] proposes to use the detection transformer [3,38] for OWOD in a justifiable way and directly leverages the framework of DDETR [38]. In addition, OW-DETR proposes an attention-driven PLM which selects pseudo labels for unknown objects according to the attention scores.…”
Section: Introductionmentioning
confidence: 99%