2023
DOI: 10.48550/arxiv.2301.05499
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CLIP the Gap: A Single Domain Generalization Approach for Object Detection

Abstract: Single Domain Generalization (SDG) tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain. While this has been well studied for image classification, the literature on SDG object detection remains almost non-existent. To address the challenges of simultaneously learning robust object localization and representation, we propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts. We achieve this v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 42 publications
0
1
0
Order By: Relevance
“…However, the model does not diversify the singledomain with data augmentation, which potentially can be improved. Vidit et al (2023) utilized a pre-trained visionlanguage model to generalize a detector, but textual hints about the target domains should be given. Our proposed method does not require prior knowledge about target domains and achieves S-DGOD in an object-aware approach.…”
Section: Single-domain Generalization For Object Detectionmentioning
confidence: 99%
“…However, the model does not diversify the singledomain with data augmentation, which potentially can be improved. Vidit et al (2023) utilized a pre-trained visionlanguage model to generalize a detector, but textual hints about the target domains should be given. Our proposed method does not require prior knowledge about target domains and achieves S-DGOD in an object-aware approach.…”
Section: Single-domain Generalization For Object Detectionmentioning
confidence: 99%