Automatic Story Segmentation for TV News Video Using Multiple Modalities

Dumont, Emilie; Quénot, Georges

doi:10.1155/2012/732514

Cited by 28 publications

(16 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, Dumont and Quénot presented a system based on multimodal features extraction [7]. The approach combines audio features as silence segments and visual descriptors like anchors or logos.…”

Section: Related Workmentioning

confidence: 99%

“…In these experiments, the approaches described in [7,9,20] were selected for comparison according to two criteria. First, we chose methods based on the same assumption to segment news programs: anchorperson shots detection as the starting point for detecting news topics.…”

Section: Experiments On Trecvid Datasetmentioning

confidence: 99%

“…The second criterion is that all approaches were experimented on the same standard TRECVID 2003 dataset. To reach an accurate comparison with other works, we used same metrics as defined in [7]. So, we evaluated the news segmentation performance of stand-alone and rewire modes using the precision, recall and F1 metrics.…”

Section: Experiments On Trecvid Datasetmentioning

confidence: 99%

See 2 more Smart Citations

Automatic topics segmentation for TV news video using prior knowledge

Zlitni

Bouaziz

Mahdi

2015

Multimed Tools Appl

View full text Add to dashboard Cite

TV streams represent a principal source of multimedia information. The goal of the proposed approach is to enable a better exploitation of this source of video by multimedia services (i.e., TV-On-Demand, catch-up TV), social community, and video-sharing platforms (Vimeo, Youtube, Facebook …). In this work, we present an automatic structuring approach of TV news. The originality of the approach is the use of the contextual and operational characteristics as prior knowledge. This knowledge is modeled as video grammar which governs the structuring of TV stream content. This structuring is carried out on two levels. The first level identifies news programs in TV stream. The second level aims to identify the internal structure of the identified news programs. At this level, we opt to treat the case of TV news programs due to the large audience because of pertinent information within. Comparison experiments to similar works have been carried out on the TRECVID 2003 database. We show significant improvements to TV news structuring exceed 90 %.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Experiments On Trecvid Datasetmentioning

confidence: 99%

See 1 more Smart Citation

Automatic topics segmentation for TV news video using prior knowledge

Zlitni

Bouaziz

Mahdi

2015

Multimed Tools Appl

View full text Add to dashboard Cite

show abstract

“…A temporal structural model is used in [29] to identify the different news stories in broadcast, while [8] uses machine-learning-based techniques to classify the shots of news video into predefined categories, e.g., anchor, interview, forcast. [10] proposes a method for the automatic segmentation of TV news videos into stories, where a temporal context and machine learning methods are used to perform the story boundaries detection from multimodal features. There exists some work focusing only on speech recognition of TV programs using Deep Neural Networks (DNN), for example, [24] uses generalized discriminant analysis for acoustic feature extraction and [11] represents acoustic features by an i-vector before adopting DNN techniques.…”

Section: Related Workmentioning

confidence: 99%

“…The most traditional way to realize automatic TV program segmentation is either classification strategies [10,27] or event detection approaches [4,17], which are mostly supervised approaches. Unsupervised approaches for program segmentation were also addressed recently, where audiovisual consistency [5] and clustering-based methods [15] are considered.…”

Section: Introductionmentioning

confidence: 99%

Content-based unsupervised segmentation of recurrent TV programs using grammatical inference

Vallet

Carrive

et al. 2017

Multimed Tools Appl

View full text Add to dashboard Cite

TV program segmentation raised as a major topic in the last decade for the task of high quality indexing of multimedia content. Earlier studies of TV program segmentation are either highly supervised (e.g., event detection) or too specific to a certain type of program (e.g., cluster-based methods), which is not practically usable for indexing tasks because of the lack of generality of programs types. In this paper, we address the problem of unsupervised TV program segmentation by leveraging grammatical inference, i.e., discovering a common structural model shared by a collection of episodes of a recurrent TV program by finding an optimal alignment of structural elements across episodes. Structural elements referring to a video segment with a particular syntactic meaning with respect to the video structure. The use of symbolic representation of structural elements makes grammatical inference feasible to be applied on TV program modeling, and makes TV program segmentation possible to rely on only minimal domain knowledge. The proposed approach is operated in two phases. The first phase aims at obtaining a symbolic representation of each episode, where the elements relevant to the structure are discovered based on recurrence mining. The second phase is that of grammatical inference from the symbolic representation of episodes. We investigate two inference techniques, one based on multiple sequence alignment and one relying on uniform resampling, to infer structural grammars for TV programs. A model of the structure is derived from the structural grammars and used to predict the structure of new episodes. Comparative evaluation on two gramBingqing Qu

show abstract