2020
DOI: 10.1109/tits.2019.2913883
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Semantic Segmentation-Based Stereo Reconstruction

Abstract: In this paper, we propose a novel semantic segmentation-based stereo reconstruction method that can keep up with the accuracy of the state-of-the art approaches while running in real time. The solution follows the classic stereo pipeline, each step in the stereo workflow being enhanced by additional information from semantic segmentation. Therefore, we introduce several improvements to computation, aggregation, and optimization by adapting existing techniques to integrate additional surface information given b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 33 publications
(9 citation statements)
references
References 49 publications
0
8
0
1
Order By: Relevance
“…The task aims to find corresponding pixels in a stereo pair, and the distance between corresponding pixels is known as disparity (Hartley and Zisserman 2003). Based on the epipolar geometry, stereo matching enables stable depth perception from estimated disparity, hence it has been applied to further applications such as scene understanding (Franke and Joos 2000;Miclea and Nedevschi 2019;Zhang et al 2010), object detection Li et al 2018Li et al , 2019Fang et al 2018), visual odometry (Wang et al 2017;Zhu 2017), and SLAM (Engel et al 2015;Gomez-Ojeda et al 2019).…”
Section: Introductionmentioning
confidence: 99%
“…The task aims to find corresponding pixels in a stereo pair, and the distance between corresponding pixels is known as disparity (Hartley and Zisserman 2003). Based on the epipolar geometry, stereo matching enables stable depth perception from estimated disparity, hence it has been applied to further applications such as scene understanding (Franke and Joos 2000;Miclea and Nedevschi 2019;Zhang et al 2010), object detection Li et al 2018Li et al , 2019Fang et al 2018), visual odometry (Wang et al 2017;Zhu 2017), and SLAM (Engel et al 2015;Gomez-Ojeda et al 2019).…”
Section: Introductionmentioning
confidence: 99%
“…In this section, we use ResNet50 and Xception39 as the backbone to analyze the enhancement of the overall network by different modules in BSDNet, as shown in Table. 4. BS indicates that only the backbone is used for the experiment, "BS+BSDModule" indicates that a BSDModule is added to the backbone, where the expansion factor e of the BSDModule is 5.…”
Section: ) Accuracy Analysismentioning
confidence: 99%
“…Semantically-Guided Depth Estimation Complementing self-supervised depth estimation by the additional prediction of semantic or instance segmentation in cross-task guidance approaches has been shown to increase the performance of both prediction modalities [7], [17], [8], [26], [27], [28]. Either the segmentation masks are given as an additional input to the network [28], or they are used to predict relative poses for the single dynamic objects to counterpoise their movement between consecutive frames [7], [17].…”
Section: Related Workmentioning
confidence: 99%