2022
DOI: 10.1109/tits.2021.3117059
|View full text |Cite
|
Sign up to set email alerts
|

Monocular Depth Estimation Through Virtual-World Supervision and Real-World SfM Self-Supervision

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(4 citation statements)
references
References 50 publications
0
4
0
Order By: Relevance
“…Finally, we compared our method to others (see Table 5) and achieved a lower AbsRel on both KITTI and DDAD1 target domains [35,24] when using real source datatasets that were not specifically tailored to them, by only using a single depth-scaling factor. We also achieved competitive or better accuracy to other depth-scale-transfer methods that used specially tailored synthetic data for KITTI [26,24,35,41] and DDAD [24], and also showed that vKITTI2 can be successfully used as a source to other real datasets such as DDAD. Our accuracy is also competitive to weakly-supervised methods that used target GT velocity [21], GPS [7] or IMU [47] during training, and to online depth scaling methods [36,46] that require road visibility during inference.…”
Section: Transferring the Global Depth Scaling Factor From The Source...mentioning
confidence: 67%
See 1 more Smart Citation
“…Finally, we compared our method to others (see Table 5) and achieved a lower AbsRel on both KITTI and DDAD1 target domains [35,24] when using real source datatasets that were not specifically tailored to them, by only using a single depth-scaling factor. We also achieved competitive or better accuracy to other depth-scale-transfer methods that used specially tailored synthetic data for KITTI [26,24,35,41] and DDAD [24], and also showed that vKITTI2 can be successfully used as a source to other real datasets such as DDAD. Our accuracy is also competitive to weakly-supervised methods that used target GT velocity [21], GPS [7] or IMU [47] during training, and to online depth scaling methods [36,46] that require road visibility during inference.…”
Section: Transferring the Global Depth Scaling Factor From The Source...mentioning
confidence: 67%
“…Since real and synthetic data still differ in style, initial attempts focused on closing these gaps using style transfer from the target to the source domain [2,49,48]. Recent solutions trained on such synthetic and real datasets [26,24,35,41] used mixed-supervision, where the MDE was self-supervised using images from both domains and also fully-supervised using the source GT depth, thus tightly coupled between the depth ranking and scaling. In addition, some solutions used additional intermediate tasks such as semantic segmentation [24] or introduced an additional network to predict the depth-scale [41].…”
Section: Previous Workmentioning
confidence: 99%
“…Yu 9 harryjun@ustc.edu.cn M. Jing 9 jing mohan@mail.ustc.edu.cn X. Qi 9 xiaohua000109@163.com Network. Predictions were obtained as a mixture of multiple networks: DiffNet [107], FeatDepth [74] and MonoDEVS-Net [33]. DiffNet and FeatDepth used a ResNet backbone, while MonoDEVSNet used DenseNet [38].…”
Section: Challenge Submissionsmentioning
confidence: 99%
“…Therefore, it can operate on depth maps Fig. 1: FusionLoc architecture diagram generated using stereo depth estimation techniques [12], [13], monocular depth estimation [14], [15], and infrared cameras such as the ones found in Kinect sensors.…”
Section: Introductionmentioning
confidence: 99%