2013
DOI: 10.1109/tpami.2012.236
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Dynamic Textures with the Hierarchical EM Algorithm for Modeling Video

Abstract: Dynamic texture (DT) is a probabilistic generative model, defined over space and time, that represents a video as the output of a linear dynamical system (LDS). The DT model has been applied to a wide variety of computer vision problems, such as motion segmentation, motion classification, and video registration. In this paper, we derive a new algorithm for clustering DT models that is based on the hierarchical EM algorithm. The proposed clustering algorithm is capable of both clustering DTs and learning novel … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
35
0
1

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 53 publications
(36 citation statements)
references
References 32 publications
0
35
0
1
Order By: Relevance
“…In addition, most methods require prior knowledge in the form of a "clean" training video containing only the background. Dynamic texture models have also shown promise in clustering the microscopic and macroscopic motion patterns present in dynamic scenes [8][9][10]. [8] performs motion segmentation by clustering video patches using a mixture of DTs.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, most methods require prior knowledge in the form of a "clean" training video containing only the background. Dynamic texture models have also shown promise in clustering the microscopic and macroscopic motion patterns present in dynamic scenes [8][9][10]. [8] performs motion segmentation by clustering video patches using a mixture of DTs.…”
Section: Introductionmentioning
confidence: 99%
“…An alternative approach, presented in [2], [3], is based on the probabilistic framework of the DT. For each video, spatiotemporal patches are extracted using dense sampling, and a dynamic texture mixture (DTM) is learned for each video using the EM algorithm [27].…”
Section: Learning the Codebookmentioning
confidence: 99%
“…T HE bag-of-systems (BoS) representation [1], a high-level descriptor of motion in a video, has seen promising results in video texture classification [2], [3], [4]. The BoS representation of videos is analogous to the bag-of-words representation of text documents, where documents are represented by counting the occurrences of each word, or the bag-of-visual-words representation of images, where images are represented by counting the occurrences of visual codewords in the image.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A second dominant approach is the global modeling of videos using Linear Dynamical Systems (LDS) (Chan and Vasconcelos, 2008;Doretto et al, 2003;Mumtaz et al, 2013;Saisan et al, 2001). In its essence, LDS is a latent variable model which projects video frames to a lower dimensional space and tracks the temporal behaviour in that lower dimensional space.…”
Section: Related Workmentioning
confidence: 99%