2002
DOI: 10.1109/6046.985555
|View full text |Cite
|
Sign up to set email alerts
|

Event based indexing of broadcasted sports video by intermodal collaboration

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
122
0

Year Published

2003
2003
2012
2012

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 185 publications
(122 citation statements)
references
References 17 publications
0
122
0
Order By: Relevance
“…This type text always appeared during the whole video sequence, for example, the extracted local text lines of Fig The title text is usually provides more information on its corresponding highlight event inference than a long term text. They are purposively added during soccer video editing, and they are aimed at providing the audience some indicative information about the video content, such as a card or a goal [56], [62]. …”
Section: Title Text Detection Localization and Trackingmentioning
confidence: 99%
“…This type text always appeared during the whole video sequence, for example, the extracted local text lines of Fig The title text is usually provides more information on its corresponding highlight event inference than a long term text. They are purposively added during soccer video editing, and they are aimed at providing the audience some indicative information about the video content, such as a card or a goal [56], [62]. …”
Section: Title Text Detection Localization and Trackingmentioning
confidence: 99%
“…We thus consider the video segmentation problem as a temporal video content trajectory breakpoint detection problem. 1 More precisely, it is a video polyline due to the discrete nature of the image frames. We first consider detecting breakpoints on a temporal video content trajectory using a Multi-Observation Hidden Markov Model (MOHMM) (Figure 4 (b)).…”
Section: Temporal Segmentation Of Surveillance Videosmentioning
confidence: 99%
“…Traditionally, a four-layer hierarchical structure is adopted for video structure analysis which consists of a frame layer, a shot layer, a scene layer and a video layer [1]. At the bottom of the structure, continuous image frames taken by a single camera are grouped into shots.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…It is common to use the domain knowledge of the class of videos for processing them, such as [9] for baseball, [10] for American football, [11] for tennis, and [12,13] for cricket etc. We use the domain knowledge of the videos to build scene categories and approximate scene models.…”
Section: Visual Domain Processing Of Videosmentioning
confidence: 99%