2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00389
|View full text |Cite
|
Sign up to set email alerts
|

Neural Window Fully-connected CRFs for Monocular Depth Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
81
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 196 publications
(82 citation statements)
references
References 25 publications
1
81
0
Order By: Relevance
“…NeWCRFs [34] Ours GT Figure 1: We observe that depth boundaries in state-of-theart [34] align well with object boundaries, but the depth label is often incorrect. Note the confusion for the middle pillow in the first row and the bed in the second row.…”
Section: Inputmentioning
confidence: 91%
See 3 more Smart Citations
“…NeWCRFs [34] Ours GT Figure 1: We observe that depth boundaries in state-of-theart [34] align well with object boundaries, but the depth label is often incorrect. Note the confusion for the middle pillow in the first row and the bed in the second row.…”
Section: Inputmentioning
confidence: 91%
“…The feature pyramidal-based decoder mitigates the issue by fusing low-resolution, semantically rich decoder features with the higher resolution but semantically weaker encoder features via a top-down path-way and lateral connections called skip connections [18]. Inline with the recent success of transformers, many latest works have used a self-attention based architectures for MDE [1,32,34]. Self-attention increases the receptive field and allows to capture long-range dependencies in feature maps.…”
Section: Inputmentioning
confidence: 99%
See 2 more Smart Citations
“…Recently, models using a classification model to boost the performance of depth estimation, such as binsformer [25] or adpative bins [26], perform well with the monocular depth estimation. However, Neural window FC-CRFs (NeWCRF) reaches the same performance by applying Conditional Random Field (CRF) on the decoder part to regress the depth map by utilizing fully-connected CRFs on each split image part (window) [27]. Therefore, we chose the Swin Transformer with NeWCRF decoder among the state of the art models for monocular depth estimation.…”
Section: A Algorithms For Shape Reconstructionmentioning
confidence: 99%