2011
DOI: 10.1109/tcsvt.2011.2138830
|View full text |Cite
|
Sign up to set email alerts
|

Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
99
0
27

Year Published

2011
2011
2022
2022

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 139 publications
(126 citation statements)
references
References 31 publications
0
99
0
27
Order By: Relevance
“…Some methods focus on scene segmentation for various types of TV programs. For example, [23] proposes a novel approach to video temporal decomposition into semantic scenes jointly exploiting lowlevel and high-level features automatically extracted from the visual and the auditory channel, where a fast scene transition graph (STG) approximation and a generalized STG-based technique are proposed for multimodal scene segmentation. [31] focuses on grouping video content into semantic segments and classifying semantic scenes into different types based on the temporal constraint of video content and visual similarity between shot activities.…”
Section: Related Workmentioning
confidence: 99%
“…Some methods focus on scene segmentation for various types of TV programs. For example, [23] proposes a novel approach to video temporal decomposition into semantic scenes jointly exploiting lowlevel and high-level features automatically extracted from the visual and the auditory channel, where a fast scene transition graph (STG) approximation and a generalized STG-based technique are proposed for multimodal scene segmentation. [31] focuses on grouping video content into semantic segments and classifying semantic scenes into different types based on the temporal constraint of video content and visual similarity between shot activities.…”
Section: Related Workmentioning
confidence: 99%
“…2), adaptive thresholds (thresholds depend on the statistics of the visual features used), B-splines fittings [30], support vector machines (SVM) [31] and K-means clustering [11]. The detection accuracy of SBD methods is improved by combining several visual features [32].…”
Section: Shot Boundary Detectionmentioning
confidence: 99%
“…Most recent techniques, e.g. [10], [12], further exploit higher-level information such as visual concept and audio event detection results in order to come to a more accurate extraction of the videos' structural semantics. Specifically, in [10] the possibility of exploiting, for the purpose of video segmentation to scenes, semantic information coming from the analysis of the visual modality, was examined.…”
Section: Retrievalmentioning
confidence: 99%