2019 International Conference on 3D Vision (3DV) 2019
DOI: 10.1109/3dv.2019.00068
|View full text |Cite
|
Sign up to set email alerts
|

Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

Abstract: Many applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regions. By contrast, scene coordinate regression (SCoRe) methods generalise to novel poses and can leverage dense correspondences to improve robustness, and recent work has shown how to adapt SCoRe forests between scene… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 36 publications
(26 citation statements)
references
References 71 publications
(288 reference statements)
0
26
0
Order By: Relevance
“…Another recent line of work on single-image localization has focused on machine learning [7,8,10,11,14,15,37,38,57,75,86,88]. Scene coordinate regression approaches [7,8,14,15,57,75,88] train a random forest or convolutional neural network (CNN) to predict the corresponding 3D coordinate for each pixel.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Another recent line of work on single-image localization has focused on machine learning [7,8,10,11,14,15,37,38,57,75,86,88]. Scene coordinate regression approaches [7,8,14,15,57,75,88] train a random forest or convolutional neural network (CNN) to predict the corresponding 3D coordinate for each pixel.…”
Section: Related Workmentioning
confidence: 99%
“…The 2D-3D matches are then used for camera pose estimation, e.g., by applying a PnP solver [1,31,41,43,45,46] inside a robust estimator such as RANSAC [4,9,17,25,47,66]. These visual localization methods typically use either local image descriptors [19,22,34,52] to explicitly match 2D features to 3D scene points or use machine learning, e.g., via a random forest [15,16] or a convolutional neural network (CNN) [6,7,14], to regress the corresponding 3D scene coordinate per pixel. They build a scene representation, e.g., a 3D Structure-from-Motion (SfM) model for local features or a CNN for scene coordinate regression, from a set of reference images.…”
Section: Introductionmentioning
confidence: 99%
“…Active Search [43] and an indoor localization method which exploits dense correspondences [53]. Note that, in general, methods that exploit additional depth information [11,12] Figure 3. Average pose accuracy on the combined scenes.…”
Section: Results On 7-scenes 12-scenes and Cambridgementioning
confidence: 99%
“…Instead of learning the en-tire pipeline, scene coordinate regression methods learn the first stage of the pipeline in the structure-based approaches. Namely, either a random forest [4,12,13,20,30,32,33,50,57] or a neural network [3,5,6,7,9,10,11,27,28,30] is trained to directly predict 3D scene coordinates for the pixels and thus the 2D-3D correspondences are established. These methods do not explicitly rely on feature detection, description and matching, and are able to provide correspondences densely.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation