2017 IEEE Intelligent Vehicles Symposium (IV) 2017
DOI: 10.1109/ivs.2017.7995968
|View full text |Cite
|
Sign up to set email alerts
|

RegNet: Multimodal sensor registration using deep neural networks

Abstract: Abstract-In this paper, we present RegNet, the first deep convolutional neural network (CNN) to infer a 6 degrees of freedom (DOF) extrinsic calibration between multimodal sensors, exemplified using a scanning LiDAR and a monocular camera. Compared to existing approaches, RegNet casts all three conventional calibration steps (feature extraction, feature matching and global regression) into a single real-time capable CNN. Our method does not require any human interaction and bridges the gap between classical of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
117
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 173 publications
(119 citation statements)
references
References 21 publications
1
117
0
1
Order By: Relevance
“…V-A3). Interestingly, several works propose to calibrate sensors by deep neural networks: Giering et al [197] discretize the spatial misalignments between LiDAR and visual camera into nine classes, and build a network to classify misalignment taking LiDAR and RGB images as inputs; Schneider et al [198] propose to fully regress the extrinsic calibration parameters between LiDAR and visual camera by deep learning. Several multi-modal CNN networks are trained on different decalibration ranges to iteratively refine the calibration output.…”
Section: ) Data Quality and Alignmentmentioning
confidence: 99%
“…V-A3). Interestingly, several works propose to calibrate sensors by deep neural networks: Giering et al [197] discretize the spatial misalignments between LiDAR and visual camera into nine classes, and build a network to classify misalignment taking LiDAR and RGB images as inputs; Schneider et al [198] propose to fully regress the extrinsic calibration parameters between LiDAR and visual camera by deep learning. Several multi-modal CNN networks are trained on different decalibration ranges to iteratively refine the calibration output.…”
Section: ) Data Quality and Alignmentmentioning
confidence: 99%
“…The work presented in this paper has been inspired by Schneider et al [26], which used 3D scans from a LiDAR and RGB images as the input of a novel CNN, RegNet. Their goal was to provide a CNN-based method for calibrating the extrinsic parameters of a camera w.r.t.…”
Section: B Camera and Lidar-map Approachesmentioning
confidence: 99%
“…Yet, surprisingly, only a few deep learning based approaches have been applied to the calibration problem. The first deep [14] regress to the calibration parameters, conditioned on the input image. Since it doesn't take geometry into account, it has to be retrained each time sensor intrinsics change.…”
Section: Related Workmentioning
confidence: 99%
“…For the depth branch, we use a similar architecture as for the RGB stream, but with half the number of filters at each stage. Like in [14], this architecture has several advantages for feature extraction. The use of pre-trained weights for the RGB input prevents learning the relevant features from scratch.…”
Section: (B)mentioning
confidence: 99%