2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763)
DOI: 10.1109/icme.2004.1394260
|View full text |Cite
|
Sign up to set email alerts
|

Formalising stories: sequences of events and state changes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…For example, dialogue detection experiments have been performed using low-level audio and visual features with a maximum classification accuracy of 96% [1]. Alternatively, emotional stages are proposed as a means for segmenting video in [24]. Detecting monologues based on audio-visual information is discussed in [11], where a maximum recall of 0.880 is reported.…”
Section: Introductionmentioning
confidence: 99%
“…For example, dialogue detection experiments have been performed using low-level audio and visual features with a maximum classification accuracy of 96% [1]. Alternatively, emotional stages are proposed as a means for segmenting video in [24]. Detecting monologues based on audio-visual information is discussed in [11], where a maximum recall of 0.880 is reported.…”
Section: Introductionmentioning
confidence: 99%
“…The LSU segmentation is based on the investigation of visual information and its temporal variations in a video sequence. A movie can be modeled as a sequence of states and events organized in space and time by creating a state graph representing the film story [47]. As far as dialogues are concerned, a dialogue scene can be defined as a set of consecutive shots, which contain conversations of people [3,22].…”
Section: Film Syntax Basicsmentioning
confidence: 99%
“…For example, automatically extracted low-level and mid-level visual features are used to detect different types of scenes, focusing on dialogue sequences [4]. Emotional stages as a means for segmenting video are proposed in [6]. The detection of monologues based on audio-visual information is discussed in [7] where a noticeably high average decision performance is reported.…”
Section: Introductionmentioning
confidence: 99%