Procedings of the British Machine Vision Conference 2008 2008
DOI: 10.5244/c.22.99
|View full text |Cite
|
Sign up to set email alerts
|

A Spatio-Temporal Descriptor Based on 3D-Gradients

Abstract: In this work, we present a novel local descriptor for video sequences. The proposed descriptor is based on histograms of oriented 3D spatio-temporal gradients. Our contribution is four-fold. (i) To compute 3D gradients for arbitrary scales, we develop a memory-efficient algorithm based on integral videos. (ii) We propose a generic 3D orientation quantization which is based on regular polyhedrons. (iii) We perform an in-depth evaluation of all descriptor parameters and optimize them for action recognition. (iv)… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

6
1,177
0
7

Year Published

2012
2012
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 1,610 publications
(1,190 citation statements)
references
References 23 publications
6
1,177
0
7
Order By: Relevance
“…Visual Feature: For all experiments HOG3D features [2], k-means quantized into a 1000-word codebook are used. For all techniques that require visual features, the approximated Histogram Intersection Kernel via feature extension [22] is used to provide higher quality results.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Visual Feature: For all experiments HOG3D features [2], k-means quantized into a 1000-word codebook are used. For all techniques that require visual features, the approximated Histogram Intersection Kernel via feature extension [22] is used to provide higher quality results.…”
Section: Methodsmentioning
confidence: 99%
“…HOG3D [2]) from video appearance, and then apply a standard clustering algorithm. For instance, Wang et al [3] cluster images strictly based on appearance, and Niebles et al [4] develop topic models based on video bag-of-words approaches.…”
Section: Introductionmentioning
confidence: 99%
“…Precisely, we use the vertex points generated in a triangular tessellation to obtain a quasi-regular distribution of the orientation bins (see Figure 5b). One alternative yielding completely regular bins is the approach of Klaser et al [34], where points in the sphere surface are projected onto a platonic solid; however, it has a limitation on the number of bins, since the platonic solid with more facets available is the icosahedron (20-sided).…”
Section: Orientation Assignmentmentioning
confidence: 99%
“…Based on the successful development of video features, e.g., STIP [1], cuboids [13], and 3D HoG [22], many human activity recognition methods have been developed. Previously, [15] The left figure illustrates our implicit spatial-temporal shape model on a training video.…”
Section: Related Workmentioning
confidence: 99%