2022
DOI: 10.48550/arxiv.2210.09071
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Attention Attention Everywhere: Monocular Depth Prediction with Skip Attention

Abstract: Monocular Depth Estimation (MDE) aims to predict pixel-wise depth given a single RGB image. For both, the convolutional as well as the recent attention-based models, encoder-decoder-based architectures have been found to be useful due to the simultaneous requirement of global context and pixel-level resolution. Typically, a skip connection module is used to fuse the encoder and decoder features, which comprises of feature map concatenation followed by a convolution operation. Inspired by the demonstrated benef… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 25 publications
0
1
0
Order By: Relevance
“…Eigen et al [16,17] proposed a multi-stage, coarseto-fine network to estimate depth from a single image, and Laina et al [35] a fully convolutional architecture for depth estimation. Some works introduce attention mechanisms to achieve significant performance improvements [1,2,37,39]. Other works estimate depth by predicting probability distributions and discrete bins [5,40,59].…”
Section: Depth Predictionmentioning
confidence: 99%
“…Eigen et al [16,17] proposed a multi-stage, coarseto-fine network to estimate depth from a single image, and Laina et al [35] a fully convolutional architecture for depth estimation. Some works introduce attention mechanisms to achieve significant performance improvements [1,2,37,39]. Other works estimate depth by predicting probability distributions and discrete bins [5,40,59].…”
Section: Depth Predictionmentioning
confidence: 99%