2012
DOI: 10.1016/j.cviu.2011.09.010
|View full text |Cite
|
Sign up to set email alerts
|

Selective spatio-temporal interest points

Abstract: a b s t r a c tRecent progress in the field of human action recognition points towards the use of Spatio-Temporal Interest Points (STIPs) for local descriptor-based recognition strategies. In this paper, we present a novel approach for robust and selective STIP detection, by applying surround suppression combined with local and temporal constraints. This new method is significantly different from existing STIP detection techniques and improves the performance by detecting more repeatable, stable and distinctiv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
52
0
5

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 121 publications
(57 citation statements)
references
References 67 publications
(138 reference statements)
0
52
0
5
Order By: Relevance
“…Second, since representations based on LST features represent local variations, in complex scenes a large proportion of detected features often fall on the cluttered background, especially when the camera is moving, as is common with mobile robots. In this case, irrelevant features from backgrounds usually decrease the ability to represent human activities themselves [9]. Third, existing LST feature descriptors are generally not adaptive to linear perspective view variations, i.e., the size of the feature's support region does not adapt to the distance to the camera, which results in decreased feature description capability.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Second, since representations based on LST features represent local variations, in complex scenes a large proportion of detected features often fall on the cluttered background, especially when the camera is moving, as is common with mobile robots. In this case, irrelevant features from backgrounds usually decrease the ability to represent human activities themselves [9]. Third, existing LST feature descriptors are generally not adaptive to linear perspective view variations, i.e., the size of the feature's support region does not adapt to the distance to the camera, which results in decreased feature description capability.…”
Section: Introductionmentioning
confidence: 99%
“…Chakraborty et al [9] proposed the selective LST feature, where interest points are extracted from the entire image and then features are pruned using surrounding suppression and space-time constraints. Our feature detector is inherently different from these feature selection methods; features irrelevant to humans are not detected (that is, no selection is performed), which significantly reduces the number of irrelevant features and thus decreases computational requirements, especially when the camera is in motion in robotics applications.…”
Section: B Local Spatio-temporal Featuresmentioning
confidence: 99%
“…Our paper describes a representation scheme akin to space time interest point (STIP) models [1], on which several other approaches build upon [20,21]. Interestingly almost all of these methods [1,22,23] have employed a local spatio-temporal scale-space extrema detection approach.…”
Section: Related Workmentioning
confidence: 99%
“…To begin, we will start developing our theory for spatiotemporal scale selection with respect to the problem of detecting sparse spatio-temporal interest points [6,9,11,14,18,20,21,30,49,88,94,97,99,100,107,122,124,126,127], which may be regarded as a conceptually simplest problem domain because of the sparsity of spatio-temporal interest points and the close connection between this problem domain and the detection of spatial interest points for which there exists a theoretically well-founded and empirically tested framework regarding scale selection over the spatial domain [1,4,5,15,17,25,42,65,72,74,84,89,90,112]. Specifically, using a non-causal Gaussian spatio-temporal scale-space model, we will perform a theoretical analysis of the spatio-temporal scale selection properties of eight different types of spatiotemporal interest point detectors and show that seven of them: (i) the spatial Laplacian of the first-order temporal derivative, (ii) the spatial Laplacian of the second-order temporal derivative, (iii) the determinant of the spatial Hessian of the first-order temporal derivative, (iv) the determinant of the spatial Hessian of the second-order temporal derivative, (v) the determinant of the spatio-temporal Hessian matrix, (vi) the first-order temporal derivative of the determinant of the spatial Hessian matrix and (vii) the second-order temporal derivative of the determinant of the spatial Hessian matrix, do all lead to fully scale-covariant spatio-temporal scale estimates and scale-invariant feature responses under independent scaling transformations of the spatial and the temporal domains.…”
Section: Fig 4 the First-and Second-order Temporal Derivatives Of Thmentioning
confidence: 99%
“…Let us approximate the spatial smoothing operation in the continuous spatio-temporal scale-space representation according to (9) by smoothing with the discrete analogue of the Gaussian kernel over the spatial domain [56] T…”
Section: Time-causal and Time-recursive Algorithm For Spatio-temporalmentioning
confidence: 99%