Learning mixed-state Markov models for statistical motion texture tracking

Crivelli, Tomas; Bouthémy, Patrick; Cernuschi-Frias, B.; Yao, Jianfeng

doi:10.1109/iccvw.2009.5457666

Cited by 5 publications

(4 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This type of stochastic models can be successfully applied in various contexts, from dynamic texture classification to motion segmentation [9] or tracking [8]. However, not unlike optical flow, LDS must respect constraints which are not easily satisfied in complex natural scenes.…”

Section: Related Work and Contributionsmentioning

confidence: 99%

Perceptual Principles for Video Classification With Slow Feature Analysis

Thériault¹,

Thome²,

Cord³

et al. 2014

IEEE J. Sel. Top. Signal Process.

View full text Add to dashboard Cite

At the core of vision research is the notion of perceptual invariance. The question of how the visual system is able to develop stable or invariant states through the ever transforming environment is central to understanding the brain's recognition process. The coined term slowness principle used in slow feature analysis is a reference to the brain's ability to generate slow changing and thus stable percepts in response to the fast varying visual stimulations in the environment. Based on this principle this paper deals with categorization of video sequences composed of dynamic natural scenes. Unlike models relying on supervised learning or handcrafted descriptors, we represent videos using unsupervised learning of motion features. Our method is based on: 1) Slow feature analysis principle from which motion features representing the principal and more stable motion components of training videos are learned. 2) Integration of the local motion feature into a global classification architecture. Classification experiments produce 11% and 19% improvements compared to state-of-the-art methods on two dynamic natural scenes data sets. A quantitative and qualitative analysis illustrates how the learned slow features untangle the input manifolds and remain stable under various parameters settings.

show abstract

Section: Related Work and Contributionsmentioning

confidence: 99%

Perceptual Principles for Video Classification With Slow Feature Analysis

Thériault¹,

Thome²,

Cord³

et al. 2014

IEEE J. Sel. Top. Signal Process.

View full text Add to dashboard Cite

show abstract

“…It is demonstrated that the normal flow scalar motion observations extracted from these video sequences, show a discrete value at zero (null-motion) and a Gaussian continuous distribution for the rest of the values. This model was extended in Crivelli et al (2006Crivelli et al ( , 2009 and applied to the problems of motion texture segmentation, recognition and tracking. For these applications, the issue is different than for simultaneous decisionestimation problems.…”

Section: Related Work and Connectionsmentioning

confidence: 99%

Simultaneous Motion Detection and Background Reconstruction with a Conditional Mixed-State Markov Random Field

et al. 2011

View full text Add to dashboard Cite

In this work we present a new way of simultaneously solving the problems of motion detection and background image reconstruction. An accurate estimation of the background is only possible if we locate the moving objects. Meanwhile, a correct motion detection is achieved if we have a good available background model. The key of our joint approach is to define a single random process that can take two types of values, instead of defining two different processes, one symbolic (motion detection) and one numeric (background intensity estimation). It thus allows to exploit the (spatio-temporal) interaction between a decision (motion detection) and an estimation (intensity reconstruction) problem. Consequently, the meaning of solving both tasks jointly, is to obtain a single optimal estimate of such a process. The intrinsic interaction and simultaneity between both problems is shown to be better modeled within the so-called mixed-state statistical framework, which is extended here to account for symbolic states and conditional random fields.T. Crivelli ( ) · B. Cernuschi-Frías Experiments on real sequences and comparisons with existing motion detection methods support our proposal. Further implications for video sequence inpainting will be also discussed.

show abstract

“…In order to explicitly model texture dynamics, linear dynamical systems (LDS) have been proposed in [32]. Such stochastic models have been successfully applied in various contexts, from dynamic texture classification to motion segmentation [6] or tacking [5]. However, LDS is intrinsically limited by the first-order markov property and linearity assumption.…”

Section: Related Work and Contributionsmentioning

confidence: 99%

“…To do this, we consider N training videos of duration T on a p × p grid as illustrated in red in figure 2. We define v n xy (t) 5 as the V1 feature for video n at spatial position (x, y) and time t. We compute all possible features v n xy (t) and compute the temporal derivativesv n xy (t). The temporal covariance matrix of equation 2 is then computed by…”

Section: Learning Local Motion Features With Sfamentioning

confidence: 99%

Dynamic Scene Classification: Learning Motion Descriptors with Slow Features Analysis

Thériault

Thome

Cord

2013

2013 IEEE Conference on Computer Vision and Pattern Recognition

View full text Add to dashboard Cite

In this paper, we address the challenging problem of categorizing video sequences composed of dynamic natural scenes. Contrarily to previous methods that rely on handcrafted descriptors, we propose here to represent videos using unsupervised learning of motion features. Our method encompasses three main contributions: 1) Based on the Slow Feature Analysis principle, we introduce a learned local motion descriptor which represents the principal and more stable motion components of training videos. 2) We integrate our local motion feature into a global coding/pooling architecture in order to provide an effective signature for each video sequence. 3) We report state of the art classification performances on two challenging natural scenes data sets. In particular, an outstanding improvement of 11% in classification score is reached on a data set introduced in 2012.

show abstract

Learning mixed-state Markov models for statistical motion texture tracking

Cited by 5 publications

References 14 publications

Perceptual Principles for Video Classification With Slow Feature Analysis

Perceptual Principles for Video Classification With Slow Feature Analysis

Simultaneous Motion Detection and Background Reconstruction with a Conditional Mixed-State Markov Random Field

Dynamic Scene Classification: Learning Motion Descriptors with Slow Features Analysis

Contact Info

Product

Resources

About