Proceedings of the 2nd ACM TRECVid Video Summarization Workshop 2008
DOI: 10.1145/1463563.1463590
|View full text |Cite
|
Sign up to set email alerts
|

Video summarization from spatio-temporal features

Abstract: In this paper we present a video summarization method based on the study of spatio-temporal activity within the video. The visual activity is estimated by measuring the number of interest points, jointly obtained in the spatial and temporal domains. The proposed approach is composed of five steps. First, image features are collected using the spatio-temporal Hessian matrix. Then, these features are processed to retrieve the candidate video segments for the summary (denoted clips). Further on, two specific step… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
39
0

Year Published

2008
2008
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 55 publications
(40 citation statements)
references
References 12 publications
0
39
0
Order By: Relevance
“…Video summarization can shorten video in several ways. In this paper, we focus on the two most common ones: keyframe selection, where the system identifies a series of defining frames [1,2,3,4,5] and key subshot selection, where the system identifies a series of defining subshots, each of which is a temporally contiguous set of frames spanning a short time interval [6,7,8,9].…”
Section: Introductionmentioning
confidence: 99%
“…Video summarization can shorten video in several ways. In this paper, we focus on the two most common ones: keyframe selection, where the system identifies a series of defining frames [1,2,3,4,5] and key subshot selection, where the system identifies a series of defining subshots, each of which is a temporally contiguous set of frames spanning a short time interval [6,7,8,9].…”
Section: Introductionmentioning
confidence: 99%
“…In (Laganière et al, 2008) the activity level is defined within a video as the number of pixels altering their characteristics between two images. As a consequence, he proposes to define an activity function by the number of detected STIPs within each frame.…”
Section: Principlementioning
confidence: 99%
“…Once shots had been selected for inclusion in the generated summary, sub-shots of 2 to 3 seconds were selected and some of them were played back at an accelerated rate. A storyboard of shot keyframes was generated and included at both the start, and the [12] end, of the generated summary. A smooth zoom from the opening storyboard to the playback window (occupying 80% of the screen) took place at the start of the summary, and as the summary transitioned from shot to shot this was tracked on the storyboard.…”
Section: Participants and Their Approachesmentioning
confidence: 99%
“…Finally, the University of Ottawa -SITE group in Canada combined with the Université de Savoie in France and a group from LAPI, University of Bucharest in Romania to study the spatio-temporal Figure 2: Example of Tukey-style boxplot activity levels of input videos [12]. This was done by generating a spatio-temporal matrix of interest points, with explicit removal of junk frames.…”
Section: Participants and Their Approachesmentioning
confidence: 99%