2012
DOI: 10.1155/2012/732514
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Story Segmentation for TV News Video Using Multiple Modalities

Abstract: While video content is often stored in rather large files or broadcasted in continuous streams, users are often interested in retrieving only a particular passage on a topic of interest to them. It is, therefore, necessary to split video documents or streams into shorter segments corresponding to appropriate retrieval units. We propose here a method for the automatic segmentation of TV news videos into stories. A-multiple-descriptor based segmentation approach is proposed. The selected multimodal features are … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 28 publications
(16 citation statements)
references
References 18 publications
0
13
0
Order By: Relevance
“…For example, Dumont and Quénot presented a system based on multimodal features extraction [7]. The approach combines audio features as silence segments and visual descriptors like anchors or logos.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…For example, Dumont and Quénot presented a system based on multimodal features extraction [7]. The approach combines audio features as silence segments and visual descriptors like anchors or logos.…”
Section: Related Workmentioning
confidence: 99%
“…In these experiments, the approaches described in [7,9,20] were selected for comparison according to two criteria. First, we chose methods based on the same assumption to segment news programs: anchorperson shots detection as the starting point for detecting news topics.…”
Section: Experiments On Trecvid Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…A temporal structural model is used in [29] to identify the different news stories in broadcast, while [8] uses machine-learning-based techniques to classify the shots of news video into predefined categories, e.g., anchor, interview, forcast. [10] proposes a method for the automatic segmentation of TV news videos into stories, where a temporal context and machine learning methods are used to perform the story boundaries detection from multimodal features. There exists some work focusing only on speech recognition of TV programs using Deep Neural Networks (DNN), for example, [24] uses generalized discriminant analysis for acoustic feature extraction and [11] represents acoustic features by an i-vector before adopting DNN techniques.…”
Section: Related Workmentioning
confidence: 99%
“…The most traditional way to realize automatic TV program segmentation is either classification strategies [10,27] or event detection approaches [4,17], which are mostly supervised approaches. Unsupervised approaches for program segmentation were also addressed recently, where audiovisual consistency [5] and clustering-based methods [15] are considered.…”
Section: Introductionmentioning
confidence: 99%