Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Application 2017
DOI: 10.5220/0006129200750085
|View full text |Cite
|
Sign up to set email alerts
|

Joint Semantic and Motion Segmentation for Dynamic Scenes using Deep Convolutional Networks

Abstract: Abstract:Dynamic scene understanding is a challenging problem and motion segmentation plays a crucial role in solving it. Incorporating semantics and motion enhances the overall perception of the dynamic scene. For applications of outdoor robotic navigation, joint learning methods have not been extensively used for extracting spatiotemporal features or adding different priors into the formulation. The task becomes even more challenging without stereo information being incorporated. This paper proposes an appro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 26 publications
0
3
0
Order By: Relevance
“…A higher scene level understanding can be obtained by integrating semantics of the scene to get better correlation between objects in the scene and the depth and ego-motion estimates. This is similar to using semantic motion segmentation [17,18]. Architectural changes could also be leveraged to get a stronger coupling between depth and pose by having a single network predicting both pose and depth in order to allow the network to be able to learn representations that capture the complex relation between both camera motion and scene depth.…”
Section: Discussionmentioning
confidence: 99%
“…A higher scene level understanding can be obtained by integrating semantics of the scene to get better correlation between objects in the scene and the depth and ego-motion estimates. This is similar to using semantic motion segmentation [17,18]. Architectural changes could also be leveraged to get a stronger coupling between depth and pose by having a single network predicting both pose and depth in order to allow the network to be able to learn representations that capture the complex relation between both camera motion and scene depth.…”
Section: Discussionmentioning
confidence: 99%
“…The current method however only performs pixel level inferences. A higher scene level understanding can be obtained by integrating semantics to get better correlation between objects in the scene and depth & ego-motion estimates, similar to semantic motion segmentation [19,20].…”
Section: Discussionmentioning
confidence: 99%
“…The spectral clustering methods are effective at data clustering but cannot handle outliers and noise and often require post-processing. In recent years, there are some motion segmentation algorithms based on deep learning [24,25,26,27], which usually obtain more accurate segmentation results. However, the results of the deep-learning-based method depend strongly on the semantic segmentation.…”
Section: Introductionmentioning
confidence: 99%