2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00527
|View full text |Cite
|
Sign up to set email alerts
|

Leveraging Self-Supervision for Cross-Domain Crowd Counting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(11 citation statements)
references
References 64 publications
0
11
0
Order By: Relevance
“…These methods can either explicitly align the feature distributions using a specific distance metric [34,36,53], or implicitly align the distributions using an adversarial loss [8,13,35] or GAN [21,41]. While most current works in domain adaptation focus on image classification [1,2,11,15,24,29,30,37,38,39,46,54], a few have delved into object detection [6,10,16,23,40,45]. Data mixing [64,66] is appealing in the context of UDA because of the opportunity to strategically blend cross-domain information during training.…”
Section: Cross-domain Object Detectionmentioning
confidence: 99%
“…These methods can either explicitly align the feature distributions using a specific distance metric [34,36,53], or implicitly align the distributions using an adversarial loss [8,13,35] or GAN [21,41]. While most current works in domain adaptation focus on image classification [1,2,11,15,24,29,30,37,38,39,46,54], a few have delved into object detection [6,10,16,23,40,45]. Data mixing [64,66] is appealing in the context of UDA because of the opportunity to strategically blend cross-domain information during training.…”
Section: Cross-domain Object Detectionmentioning
confidence: 99%
“…Their approach to the problem of synthetic domain adaptation involved using SE Cycle GAN, a method typically used for style transfer in images, to transform the synthetic images such that they appear in the style of the target domain. Liu et al [103] approach the problem by jointly training a neural network on the synthetic data in a fully-supervised fashion and on the real data in a self-supervised fashion. Their network attempts to solve a proxy task, which involves predicting whether or not an image from the real distribution has been flipped upside down.…”
Section: Synthetic Datamentioning
confidence: 99%
“…GCC only tackles crowd counting problems, likely due to simple 3D assets for humans being highly accessible. Further, there is still a large domain gap between these synthetic images and real images, which has prevented performance gains [103]. Generating high-quality and realistic synthetic images without relying on 3D assets would allow the synthetic counting problem to easily extend to other object classes, and would significantly reduce the annotation burden.…”
Section: Synthetic Datamentioning
confidence: 99%
“…The former method [28] is mainly based on CycleGAN [29], which first translates the synthesised labelled data into images with similar styles to the target domain and then trains the translated images in the crowd counting task. The latter method [30][31][32] utilises the adversarial learning strategy of GANs, where a discriminator distinguishes the density maps of the source and target domains produced by a generator to align the data distribution. In this paper, we combine the above two strategies.…”
Section: Domain Adaptationmentioning
confidence: 99%