2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00266
|View full text |Cite
|
Sign up to set email alerts
|

Stochastic Attraction-Repulsion Embedding for Large Scale Image Localization

Abstract: This paper tackles the problem of large-scale imagebased localization (IBL) where the spatial location of a query image is determined by finding out the most similar reference images in a large database. For solving this problem, a critical task is to learn discriminative image representation that captures informative information relevant for localization. We propose a novel representation learning method having higher location-discriminating power.It provides the following contributions: 1) we represent a pla… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
124
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 100 publications
(124 citation statements)
references
References 39 publications
0
124
0
Order By: Relevance
“…Given embeddings f g and f s of the ground-view panorama and satellite image, respectively, the aim of crossview metric learning is to embed the cross-view embeddings to a same space, with metric distances (L 2 -metric) between embeddings reflect the similarity/dissimilarity between cross-view images. There are many metric learning objective functions available, e.g., triplet ranking [5], SARE [19], contrastive [24], angular [32] losses. All losses try to pull the L 2 distances between matchable cross-view embeddings, while pushing the L 2 distances among nonmatchable cross-view embeddings.…”
Section: Triplet Loss For Cross-view Metric Learningmentioning
confidence: 99%
“…Given embeddings f g and f s of the ground-view panorama and satellite image, respectively, the aim of crossview metric learning is to embed the cross-view embeddings to a same space, with metric distances (L 2 -metric) between embeddings reflect the similarity/dissimilarity between cross-view images. There are many metric learning objective functions available, e.g., triplet ranking [5], SARE [19], contrastive [24], angular [32] losses. All losses try to pull the L 2 distances between matchable cross-view embeddings, while pushing the L 2 distances among nonmatchable cross-view embeddings.…”
Section: Triplet Loss For Cross-view Metric Learningmentioning
confidence: 99%
“…The contrastive loss, pulling the distance between positive pairs, could further improve the geo-localization results [16,35]. Recently, Liu et al propose Stochastic Attraction and Repulsion Embedding (SARE) loss, minimizing the KL divergence between the learned and the actual distributions [19]. Another line of works focuses on the spatial misalignment problem in the ground-to-aerial matching.…”
Section: Deeply-learned Feature For Geo-localizationmentioning
confidence: 99%
“…The final layer is the softmax layer. More details about the VGG16 architecture can be found in [14,24,25].…”
Section: Related Workmentioning
confidence: 99%