2016
DOI: 10.1007/s11042-016-4200-x
|View full text |Cite
|
Sign up to set email alerts
|

Depth estimation from single monocular images using deep hybrid network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 26 publications
0
7
0
Order By: Relevance
“…They adopted a long shortterm memory (LSTM) unit to alleviate the overfitting. Grigorev et al utilized VGG-16 as the backbone network, and two-layer RNN to obtain the global context (14,25). Chen et al had a special thought to estimate the depth of a single image by combining the RGB-D data with relative depth data (26).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…They adopted a long shortterm memory (LSTM) unit to alleviate the overfitting. Grigorev et al utilized VGG-16 as the backbone network, and two-layer RNN to obtain the global context (14,25). Chen et al had a special thought to estimate the depth of a single image by combining the RGB-D data with relative depth data (26).…”
Section: Related Workmentioning
confidence: 99%
“…Unfortunately, CRFs are notoriously slow and hard to optimize. Besides, some researchers utilized the recurrent neural network (RNN) to integrate the global features and local features, but they did not design an efficient feature correlation (FCL) network (11)(12)(13)(14)(15).…”
mentioning
confidence: 99%
“…As far as we know, using CNN alone cannot model the longrange context well. Therefore, Grigorev et al [33] proposed a hybrid network by combining convolutional layer and ReNet layer. The ReNet layer consists of Long Short-Term Memory units (LSTMs), so the ReNet layer can obtain a global context feature representation.…”
Section: Deep Learning For Visual Odometry a Deep Learning For Depth ...mentioning
confidence: 99%
“…where M is an × diagonal matrix with th diagonal entry defined in (6), L = Λ −A represents the × Laplacian matrix at th iteration, where A denotes an × affinity matrix with entry of the th row and th column defined in (7), and Λ is an × diagonal matrix with th diagonal entry Λ defined in (8).…”
Section: Solver the Optimization Problem To Minimizementioning
confidence: 99%
“…Automatic method attempts to estimate depth from monoscopic images utilizing various cues such as defocus, texture gradients, and scattering [5]. Recently, deep-learninginspired approaches have been proposed for automatically converting 2D video/image to 3D format [5][6][7][8][9][10]. Although these methods can produce depth maps automatically, they are hard to provide robust and stable conversion results in any general content.…”
Section: Introductionmentioning
confidence: 99%