Visual and Multimedia Information Management 2002
DOI: 10.1007/978-0-387-35592-4_8
|View full text |Cite
|
Sign up to set email alerts
|

The Segmentation and Classification of Story Boundaries in News Video

Abstract: The segmentation and classification of news video into single-story semantic units is a challenging problem. This research proposes a two-level, multi-modal framework to tackle this problein. The video is analyzed at the shot and story unit (or scene) levels using a variety of features and techniques. At the shot level, we employ a Decision Tree to classify the shot into one of 13 pre-defined categories. At the scene level, we perform the HMM (Hidden Markov Models) analysis to eliminate shot classification err… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2006
2006
2014
2014

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 12 publications
(18 citation statements)
references
References 16 publications
0
18
0
Order By: Relevance
“…In the literature, there are several works that study scene analysis in sports videos, CCTV (Closed-Circuit Television) and news programs [9], [10], [11]. In [10], authors noticed 13 categories of shots in TV-news (interview, anchor, commercial, etc) and trained a decision tree to classify shots, then a Hidden Markov Model (HMM) to correct classification errors.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In the literature, there are several works that study scene analysis in sports videos, CCTV (Closed-Circuit Television) and news programs [9], [10], [11]. In [10], authors noticed 13 categories of shots in TV-news (interview, anchor, commercial, etc) and trained a decision tree to classify shots, then a Hidden Markov Model (HMM) to correct classification errors.…”
Section: Related Workmentioning
confidence: 99%
“…In [10], authors noticed 13 categories of shots in TV-news (interview, anchor, commercial, etc) and trained a decision tree to classify shots, then a Hidden Markov Model (HMM) to correct classification errors. Features used were shot durations, motion measurements, text and face detections.…”
Section: Related Workmentioning
confidence: 99%
“…The idea is that, since there are more silent frames in speech, the LSTER measure will be much higher for speech than that for music [21].…”
Section: High Zero Crossing Rate Ratio (Hzcrr)mentioning
confidence: 99%
“…In order to alleviate the data sparseness problem, one way is to divide the framework into multi-layered tasks such as that employed in NLP research [9] that analyzes text documents at word, phrase and sentence levels. This multilayered approach was successfully adopted in [5,6,7].…”
Section: Introductionmentioning
confidence: 99%
“…It employed a hybrid approach and the HMM for the analysis at shot and story levels respectively. The system was evaluated on the TRECVID 2003 data set [3] and it achieved the best performance in the evaluations under story segmentation task [3,6]. But the disadvantages of the system are that it is computational expensive and not easy to scale up to larger and new corpuses.…”
Section: Introductionmentioning
confidence: 99%