2013
DOI: 10.1007/978-3-642-40395-8_15
|View full text |Cite
|
Sign up to set email alerts
|

Semantic Video Segmentation from Occlusion Relations within a Convex Optimization Framework

Abstract: Abstract. We describe an approach to incorporate scene topology and semantics into pixel-level object detection and localization. Our method requires video to determine occlusion regions and thence local depth ordering, and any visual recognition scheme that provides a score at local image regions, for instance object detection probabilities. We set up a cost functional that incorporates occlusion cues induced by object boundaries, label consistency and recognition priors, and solve it using a convex optimizat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 33 publications
0
10
0
Order By: Relevance
“…They usually do not use high-level object recognizers or try to improve optical flow. Taylor et al [51] incorporate object detections and use temporal information to reason about occlusions to improve their segmentation results, but do not compute optical flow. Lalos et al [30] compute optical flow for an object of interest using a tracking-by-detection approach.…”
Section: Related Workmentioning
confidence: 99%
“…They usually do not use high-level object recognizers or try to improve optical flow. Taylor et al [51] incorporate object detections and use temporal information to reason about occlusions to improve their segmentation results, but do not compute optical flow. Lalos et al [30] compute optical flow for an object of interest using a tracking-by-detection approach.…”
Section: Related Workmentioning
confidence: 99%
“…However, they do not model multiple semantic object classes, nor do they capture contextual relations between objects and background classes. Taylor et al [21] jointly infer pixel semantic classes and occlusion relationship in video segmentation. Unlike our method, they do not incorporate object instance level reasoning.…”
Section: Related Workmentioning
confidence: 99%
“…It also provides relations between objects, in the sense that the scheme does not just attach labels to object, but also determines whether there are multiple objects, and in what depth ordering they are presented relative to the viewer. The key results have been presented in [17], where the ideas have been tested on benchmark datasets.…”
Section: Semantic Video Segmentationmentioning
confidence: 99%
“…The following references describe work that has been conducted during this project and acknowledge support by ARO: Year 1: [14,3,9,15,1,5,8,12], Year 2: [2,4,10,6,11,19], Year 3: [7,17,18].…”
Section: Publicationsmentioning
confidence: 99%
See 1 more Smart Citation