2021
DOI: 10.1109/tcsvt.2021.3049869
|View full text |Cite
|
Sign up to set email alerts
|

Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
60
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 175 publications
(62 citation statements)
references
References 36 publications
2
60
0
Order By: Relevance
“…Lee et al [37] demonstrated that by using DenseNet161 [18] as the encoder backbone for the NYUv2 [4] dataset, their method's accuracy was higher than when using ResNet101 [17]. Song et al [41] further demonstrated in their ablation studies that for the KITTI [5] dataset, the ResNeXt [36] encoder provides the best performance for their model, which matched the findings by Lee et al [37] for this dataset. Ablation studies from Bhat et al [21] illustrate that the use of the EfficientNet-B5 [1] can produce very good predictive performance with a basic decoder.…”
Section: ) Encoders For Monocular Depth Estimationsupporting
confidence: 55%
See 2 more Smart Citations
“…Lee et al [37] demonstrated that by using DenseNet161 [18] as the encoder backbone for the NYUv2 [4] dataset, their method's accuracy was higher than when using ResNet101 [17]. Song et al [41] further demonstrated in their ablation studies that for the KITTI [5] dataset, the ResNeXt [36] encoder provides the best performance for their model, which matched the findings by Lee et al [37] for this dataset. Ablation studies from Bhat et al [21] illustrate that the use of the EfficientNet-B5 [1] can produce very good predictive performance with a basic decoder.…”
Section: ) Encoders For Monocular Depth Estimationsupporting
confidence: 55%
“…On top of their dilated ResNet-101 backbone in Stage 1, they use the ASPP module [20] to gather global contextual information in Stage 2. ASPP modules in different forms have since been adopted by Yin et al [35], Lee et al [37] and Song et al [41].…”
Section: ) Dilated Convolutions In Depth Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…They employed a reinforcement learning algorithm and automatically prune redundant channels of MDE by finding a relatively optimal pruning policy. Song et al [28] et al proposed a simple but effective scheme by incorporating the Laplacian pyramid into the decoder architecture. Specifically, encoded features were fed into different streams for decoding depth residuals.…”
Section: Related Workmentioning
confidence: 99%
“…The first works in this area [14], [15] used ground truth depth to learn supervisedly. Later research contributed mainly by proposing architectural innovations [16]- [19]. All these methods rely on accurate ground truth labels at training, which is not trivial in many application domains.…”
Section: B Single-view Depth Learningmentioning
confidence: 99%