2019
DOI: 10.3390/s19081795
|View full text |Cite
|
Sign up to set email alerts
|

Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network

Abstract: Semantic segmentation and depth estimation are two important tasks in computer vision, and many methods have been developed to tackle them. Commonly these two tasks are addressed independently, but recently the idea of merging these two problems into a sole framework has been studied under the assumption that integrating two highly correlated tasks may benefit each other to improve the estimation accuracy. In this paper, depth estimation and semantic segmentation are jointly addressed using a single RGB input … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
31
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(31 citation statements)
references
References 34 publications
(82 reference statements)
0
31
0
Order By: Relevance
“…The thresholded accuracy means the ratio of the maximum relative error below the threshold . In our experiments, we use to align the setting with previous studies [ 6 , 8 , 13 , 15 , 16 , 17 , 42 ].…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…The thresholded accuracy means the ratio of the maximum relative error below the threshold . In our experiments, we use to align the setting with previous studies [ 6 , 8 , 13 , 15 , 16 , 17 , 42 ].…”
Section: Methodsmentioning
confidence: 99%
“…Multi-task learning with the goal of learning multiple tasks simultaneously [ 40 ] is nowadays based on CNNs [ 13 , 15 , 16 , 17 , 18 , 19 , 20 , 41 , 42 , 43 , 44 , 44 ]. Eigen and Fergus proposed a network structure that can estimate the depth, semantic labels, and the surface orientation of a scene [ 13 ].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Monocular Depth Estimation [22][23][24][25][26][27][28] infers depth information from single RGB images and is demonstrated to be an ill-posed problem(In most cases, there are several possible outputs corresponding to a given input image and the problem can be seen as a task of selecting the most proper one from all the possible outputs [29]). Stereo Depth Estimation [30][31][32] needs specific devices to capture stereo images and can provide much more clues to estimate the depth than Monocular Depth Estimation.…”
Section: Introductionmentioning
confidence: 99%