2011 18th IEEE International Conference on Image Processing 2011
DOI: 10.1109/icip.2011.6116707
|View full text |Cite
|
Sign up to set email alerts
|

Visual pertinent 2D-to-3D video conversion by multi-cue fusion

Abstract: We describe an approach to2D-to-3D video conversion for the stereoscopic display. Targeting the problem of synthesizing the frames of a virtual "right view" from the original monocular 2D video, we generate the stereoscopic video in steps as following. (1) A 2.5D depth map is first estimated in a multi-cue fusion manner by leveraging motion cues and photometric cues in video frames with a depth prior of spatial and temporal smoothness. (2)The depth map is converted to a disparity map with considering both the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 19 publications
(16 citation statements)
references
References 21 publications
0
16
0
Order By: Relevance
“…Fig. 5(a) displays another scene, whose complex background depth model is estimated by an automatic algorithm [34] and the foreground object is represented with a simple planar surface model. These results show the flexibility of the proposed CVCVC system in estimating scene depth.…”
Section: B Results and Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…Fig. 5(a) displays another scene, whose complex background depth model is estimated by an automatic algorithm [34] and the foreground object is represented with a simple planar surface model. These results show the flexibility of the proposed CVCVC system in estimating scene depth.…”
Section: B Results and Comparisonmentioning
confidence: 99%
“…The TransFantasy system automatically converts 2D videos into 3D. In [34], motion and aerial perspective cues are leveraged to estimate depth maps. In [33], disparity maps, but not depth maps, are directly estimated by a trained SVM using features of object motion and context motion.…”
Section: B 2d-to-stereo Video Conversion Methods and Systemsmentioning
confidence: 99%
“…At the decoder side, a 2 rom bit-stream, and then the depth cues ar ding to image features of 2D video. At last, the n a DIBR method [9].…”
Section: A Foreground Depth Mapmentioning
confidence: 99%
“…I lossless 2D shape coding alg to select a minimum numbe from a dense object contou shape can still be recovere adopt the Intelligent Scissors to skip redundant contour p features as shape recovery cu the selected control points ar the object contour can be f using image features as algorithm is described in Alg lattice is represented by a g weight on a graph edge is def 3: An example of background modeling em, near objects are labeled in detail, but for matic depth estimation (may not be so accurate) ithm proposed in [9] to automatically estimate t ws:…”
Section: A Coding Foreground Objmentioning
confidence: 99%
See 1 more Smart Citation