Self-supervised learning for using overhead imagery as maps in outdoor range sensor localization

Tang, Tim Y.; Martini, Daniele De; Wu, Shangzhe; Newman, Paul

doi:10.1177/02783649211045736

Cited by 26 publications

(19 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Missing cells indicate scenes that are not evaluated or not tracked successfully. Tang et al [14], [15], [16] show results for x and y dimensions separately -we report x 2 + y 2 . For each method, the best reported results are shown.…”

Section: Discussionmentioning

confidence: 70%

“…Previous methods based on end-to-end trainable models [14], [15], [37], [16] utilize only input from range-scanners (i.e. radar and lidar) on the ground vehicle.…”

Section: End-to-end Learnable Featuresmentioning

confidence: 99%

“…radar and lidar) on the ground vehicle. The works by Tang et al [14], [15], [16] also discard lidar points on the ground plane and thereby focus entirely on vertical structures.…”

Section: End-to-end Learnable Featuresmentioning

confidence: 99%

“…• Several methods [13], [14], [15], [16] train and test on data from the same city area. Others [17], [18] assume that the ground region or aerial data to be tested on has already been seen during the training stage.…”

Section: Introductionmentioning

confidence: 99%

“…• Some methods rely on specific environment features such as edges of buildings [19], [20], [14], [15], [16] or semantic features [13], [21]. This limits the applicability to locations where these features are present and visible, i.e.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Continuous Self-Localization on Aerial Images Using Visual and Lidar Sensors

Fervers¹,

Bullinger²,

Bodensteiner³

et al. 2022

Preprint

View full text Add to dashboard Cite

This paper proposes a novel method for geo-tracking, i.e. continuous metric self-localization in outdoor environments by registering a vehicle's sensor information with aerial imagery of an unseen target region. Geo-tracking methods offer the potential to supplant noisy signals from global navigation satellite systems (GNSS) and expensive and hard to maintain prior maps that are typically used for this purpose. The proposed geo-tracking method aligns data from on-board cameras and lidar sensors with geo-registered orthophotos to continuously localize a vehicle. We train a model in a metric learning setting to extract visual features from ground and aerial images. The ground features are projected into a top-down perspective via the lidar points and are matched with the aerial features to determine the relative pose between vehicle and orthophoto.Our method is the first to utilize on-board cameras in an end-to-end differentiable model for metric self-localization on unseen orthophotos. It exhibits strong generalization, is robust to changes in the environment and requires only geo-poses as ground truth. We evaluate our approach on the KITTI-360 dataset and achieve a mean absolute position error (APE) of 0.94m. We further compare with previous approaches on the KITTI odometry dataset and achieve state-of-the-art results on the geo-tracking task.

show abstract

Section: Discussionmentioning

confidence: 70%

“…Previous methods based on end-to-end trainable models [14], [15], [37], [16] utilize only input from range-scanners (i.e. radar and lidar) on the ground vehicle.…”

Section: End-to-end Learnable Featuresmentioning

confidence: 99%

“…radar and lidar) on the ground vehicle. The works by Tang et al [14], [15], [16] also discard lidar points on the ground plane and thereby focus entirely on vertical structures.…”

Section: End-to-end Learnable Featuresmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Continuous Self-Localization on Aerial Images Using Visual and Lidar Sensors

Fervers¹,

Bullinger²,

Bodensteiner³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

Visual Cross-View Metric Localization with Dense Uncertainty Estimates

Xia

Booij

Manfredi

et al. 2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

This work addresses visual cross-view metric localization for outdoor robotics. Given a ground-level color image and a satellite patch that contains the local surroundings, the task is to identify the location of the ground camera within the satellite patch. Related work addressed this task for range-sensors (LiDAR, Radar), but for vision, only as a secondary regression step after an initial cross-view image retrieval step. Since the local satellite patch could also be retrieved through any rough localization prior (e.g. from GPS/GNSS, temporal filtering), we drop the image retrieval objective and focus on the metric localization only. We devise a novel network architecture with denser satellite descriptors, similarity matching at the bottleneck (rather than at the output as in image retrieval), and a dense spatial distribution as output to capture multimodal localization ambiguities. We compare against a state-of-the-art regression baseline that uses global image descriptors. Quantitative and qualitative experimental results on the recently proposed VIGOR and the Oxford RobotCar datasets validate our design. The produced probabilities are correlated with localization accuracy, and can even be used to roughly estimate the ground camera's heading when its orientation is unknown. Overall, our method reduces the median metric localization error by 51%, 37%, and 28% compared to the state-of-the-art when generalizing respectively in the same area, across areas, and across time.

show abstract