2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020
DOI: 10.1109/cvprw50498.2020.00027
|View full text |Cite
|
Sign up to set email alerts
|

Extending Absolute Pose Regression to Multiple Scenes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 23 publications
(12 citation statements)
references
References 17 publications
0
12
0
Order By: Relevance
“…Tables 1 and 2 show the results obtained with our method (MS-Transformer) and with MSPN on the Cam-bridgeLandmarks and the 7Scenes datasets, respectively. Since MSPN was trained on different scene combinations from the CambridgeLandmarks dataset, we take the best performing model reported by the authors on this dataset [3]. Our method consistently outperforms MSPN across outdoor and indoor scenes, reducing both position and orientation errors.…”
Section: Comparative Analysis Of Aprsmentioning
confidence: 99%
See 3 more Smart Citations
“…Tables 1 and 2 show the results obtained with our method (MS-Transformer) and with MSPN on the Cam-bridgeLandmarks and the 7Scenes datasets, respectively. Since MSPN was trained on different scene combinations from the CambridgeLandmarks dataset, we take the best performing model reported by the authors on this dataset [3]. Our method consistently outperforms MSPN across outdoor and indoor scenes, reducing both position and orientation errors.…”
Section: Comparative Analysis Of Aprsmentioning
confidence: 99%
“…However, similar to APRs, a model needs to be trained per scene. In addition, these method are challenging to implement, require a long time to converge and are slower (100ms) by an order of magnitude compared to absolute pose regression approaches (10ms) at inference time [3]. They also suffer from a non-deterministic behavior due to the inherent randomness of RANSAC.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…APRs are typically trained per scene, encoding images with a convolutional backbone and then regressing the camera pose parameters with a multi-layer perceptron (MLP) [25,23,24,28,29,48,36]. This scheme was recently extended to learn multiple scenes with a single model using Transformers [38] or by indexing scene-specific weights [5]. Pose encoding was also proposed as a means for introducing scene priors and improving performance [39].…”
Section: Visual Localizationmentioning
confidence: 99%