2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01085
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Detect Scene Landmarks for Camera Localization

Abstract: Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-specific 3D points or landmarks and computing camera pose from the associated 2D-3D correspondences. Although SLD outp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 21 publications
(4 citation statements)
references
References 78 publications
0
4
0
Order By: Relevance
“…Approaches based on implicit representations map image pixels or patches to 3D points by training scene coordinate regression models [3,38]. Recently, it was claimed that such approaches are inherently privacy-preserving [11]. However, feature-based methods currently scale better to large scenes and are able to better handle condition changes [44], such as illumination or seasonal changes, between the query image and the database images used to build the the scene representation.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Approaches based on implicit representations map image pixels or patches to 3D points by training scene coordinate regression models [3,38]. Recently, it was claimed that such approaches are inherently privacy-preserving [11]. However, feature-based methods currently scale better to large scenes and are able to better handle condition changes [44], such as illumination or seasonal changes, between the query image and the database images used to build the the scene representation.…”
Section: Related Workmentioning
confidence: 99%
“…Work on privacy-preserving queries to a localization server thus mostly aims at developing features that prevent image recovery [14,26] or on obfuscating the feature geometry [16,42]. Similarly, work on privacy-preserving scene representation aims to obfuscate the geometry [37,41] (although scene geometry can be recovered under certain conditions [7]), splitting the maps over multiple server for increased data security [15], using implicit representations [11], or storing raw geometry without any feature descriptors [52].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Generally, algorithms for visual localisation mostly rely on complex 3D point clouds that are expensive to build, store, and maintain over time. Do et al (2022a) trained a CNN to detect the appearance of a sparse set of 3D scene points (scene landmarks), and showed that those predicted landmarks can yield accurate pose estimates, while being privacy preserving and requiring low data storage. Panek et al (2022) explored dense 3D scene models as an alternative to the sparse Structure-from-Motion point clouds as they are more flexible than SfM-based representations and can be rather compact.…”
Section: Visual Localisationmentioning
confidence: 99%