2012
DOI: 10.1186/1687-6180-2012-237
|View full text |Cite
|
Sign up to set email alerts
|

A stereoscopic video conversion scheme based on spatio-temporal analysis of MPEG videos

Abstract: In this article, an automatic stereoscopic video conversion scheme which accepts MPEG-encoded videos as input is proposed. Our scheme is depth-based, relying on spatio-temporal analysis of the decoded video data to yield depth perception cues, such as temporal motion and spatial contrast, which reflect the relative depths between the foreground and the background areas. Our scheme is shot-adaptive, demanding that shot change detection and shot classification be performed for tuning of algorithm or parameters t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…In addition to smoothness in spatial domain, smoothness in temporal domain is also important to avoid flickering artifacts in depth signal processing [17]. To simultaneously consider depth smoothness in both the spatial and temporal domains, trilateral filter can be extended to develop a quadri-lateral filter [8], where each support window occupies a cube and the weight is controlled according to spatial, color, depth, and temporal information.…”
Section: B Quadri-lateral Filter [8]mentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to smoothness in spatial domain, smoothness in temporal domain is also important to avoid flickering artifacts in depth signal processing [17]. To simultaneously consider depth smoothness in both the spatial and temporal domains, trilateral filter can be extended to develop a quadri-lateral filter [8], where each support window occupies a cube and the weight is controlled according to spatial, color, depth, and temporal information.…”
Section: B Quadri-lateral Filter [8]mentioning
confidence: 99%
“…Taking for example, bilateral filtering is often performed after depth estimation in 2D-to-3D stereo image/video conversion [7], [14], [17] to diminish estimation noises and make depth edges consistent with those in colors. In addition, after depth compression, coding distortions around object boundaries may lead to incorrect pixel shifts in synthesized views and then perceivable annoying artifacts [24].…”
Section: Introductionmentioning
confidence: 98%
“…The FAM is composed of facial regions detected by using face detector [21]. Except for facial cue, we linearly combine the other visual clues and then yield a non-facial attention map (NFAM) [17]. For the t-th frame, its NFAM can be expressed as follows:…”
Section: B4 Facial Cue Combinationmentioning
confidence: 99%
“…Traditional 3D video format in terms of stereo-view sequence is only suitable for glassed 3D display. To provide more realistic 3D perception (e.g., auto-stereoscopic display [1][2]), the format of video-plusdepth is preferable and gains much attention recently. Thanks to the DIBR (Depth-Image-Based Rendering) technique that is capable of combining color and depth information [1][2] for the synthesis of multiple novel-views (so that observes can see objects from multiple viewing directions).…”
Section: Introductionmentioning
confidence: 99%