Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413893
|View full text |Cite
|
Sign up to set email alerts
|

Dual Adversarial Network for Unsupervised Ground/Satellite-to-Aerial Scene Adaptation

Abstract: Columbia(a) Harbor (b) Forest (c) Residence (d) Beach (e) Parking Lot Collected from Satellite Collected from UAV Collected from Land Ground View Aerial View Satellite View Figure 1: Examples of scenes from top-down views. From top to down are scenes from the satellite view, the aerial view, and the ground view. Scenes from the satellite view are with much lower resolution and clarity compared with the aerial view. Scenes from the ground view and the aerial view are with huge domain gap even with the consisten… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 57 publications
(40 reference statements)
0
7
0
Order By: Relevance
“…The satellite/aerial to ground matching at the scene level is highly challenging due to the large semantic gap between the ground and overhead scene, but still resolvable. For example, the authors in [39] proposed an dual adversarial solution for unsupervised satellite/aerial to ground scene adaptation solution. However, it becomes very crucial when object level matching is concerned, e.g., when approaching automatic geolocalization [40], [41], and in particular when considering image synthesis [42]- [44], where strong geometric models of the various modalities, with the ability to model uncertainty, are strongly needed.…”
Section: Multimodal Approaches With Social Mediamentioning
confidence: 99%
“…The satellite/aerial to ground matching at the scene level is highly challenging due to the large semantic gap between the ground and overhead scene, but still resolvable. For example, the authors in [39] proposed an dual adversarial solution for unsupervised satellite/aerial to ground scene adaptation solution. However, it becomes very crucial when object level matching is concerned, e.g., when approaching automatic geolocalization [40], [41], and in particular when considering image synthesis [42]- [44], where strong geometric models of the various modalities, with the ability to model uncertainty, are strongly needed.…”
Section: Multimodal Approaches With Social Mediamentioning
confidence: 99%
“…In recent years, many efforts ( Zhu et al, 2017 ), e.g., developing novel network architectures ( Murray et al, 2019 , Cheng et al, 2020 , Bi et al, 2020 , Niazmardi et al, 2017 , Lin et al, 2020 , Zhu et al, 2018 ) and pipelines ( Byju et al, 2000 , Xu et al, 2020 , Wang et al, 2019 , Zhu et al, 2019 ), publishing large-scale datasets ( Xia et al, 2017 , Jin et al, 2018 ), introducing multi-modal and multi-temporal data ( Hu et al, 2020 , Tuia et al, 2016 , Ru et al, 2020 , Li et al, 2020a ), have been deployed to address this task, and most of them treat it as a single-label classification problem. A common assumption shared by these researches is that an aerial image belongs to only one scene category, while in real-world scenarios, it is more often that there exist various scenes in a single image (cf.…”
Section: Introductionmentioning
confidence: 99%
“…Besides, it assigns static weights to neighbours, which results in the inability to adapt to different inference relations. We argue that the accuracy of the inference results is not only affected by the explicit characteristics of the entity itself, but also by the implicit characteristics [9, 10]. For example, the implicit characteristics of the entity's neighbours can cascade to affect the overall characteristics of the entities [11, 12].…”
Section: Introductionmentioning
confidence: 99%