2019
DOI: 10.1177/1729881419893518
|View full text |Cite
|
Sign up to set email alerts
|

Convolutional neural network-based coarse initial position estimation of a monocular camera in large-scale 3D light detection and ranging maps

Abstract: Initial position estimation in global maps, which is a prerequisite for accurate localization, plays a critical role in mobile robot navigation tasks. Global positioning system signals often become unreliable in disaster sites or indoor areas, which require other localization methods to help the robot in searching and rescuing. Many visual-based approaches focus on estimating a robot's position within prior maps acquired with cameras. In contrast to conventional methods that need a coarse estimation of initial… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 34 publications
0
3
0
Order By: Relevance
“…This method learns the cross-domain features through a weighted soft-margin triplet loss, while ignoring the underlying geometry connections between the two different domains. Sun et.al [11] introduce an image-to-range coarse localization method by building the feature connections within depth images, where the depth is estimated from a depth prediction network.…”
Section: Related Workmentioning
confidence: 99%
“…This method learns the cross-domain features through a weighted soft-margin triplet loss, while ignoring the underlying geometry connections between the two different domains. Sun et.al [11] introduce an image-to-range coarse localization method by building the feature connections within depth images, where the depth is estimated from a depth prediction network.…”
Section: Related Workmentioning
confidence: 99%
“…In the precision-recall curve evaluation index, precision ¼ TP/(TPþFP) indicates the probability that the prediction is correct in the positive sample, and recall ¼ TP/(TPþFN) indicates the probability that the prediction is correct in the actual positive sample. 18 The more the precision-recall curve of a classifier in the plane rectangular coordinate system is convex to the upper right, indicating that the performance of this classifier is better. In addition, since the precision-recall curve is affected by the test sample, there will be a large fluctuation of the curve, so the AveP is generally used to measure the performance of the classifier.…”
Section: Experiments Processmentioning
confidence: 99%
“…Compared with traditional LDI, multiview LDI (MVLDI) creates fewer layers and eliminates more redundancy. In references [35,36], the author proposes a novel method that uses a convolutional neural network to enable a mobile robot to estimate its rough position in a 3D map using only a monocular camera. e article uses a pretrained convolutional neural network model to generate a depth image descriptor, and retrieves the location by calculating the similarity score between the current depth image and the depth image projected from the 3D map.…”
Section: Introductionmentioning
confidence: 99%