Recent research in image and video recognition indicates that many visual
processes can be thought of as being generated by a time-varying generative
model. A nearby descriptive model for visual processes is thus a statistical
distribution that varies over time. Specifically, modeling visual processes as
streams of histograms generated by a kernelized linear dynamic system turns out
to be efficient. We refer to such a model as a System of Bags. In this work, we
investigate Systems of Bags with special emphasis on dynamic scenes and dynamic
textures. Parameters of linear dynamic systems suffer from ambiguities. In
order to cope with these ambiguities in the kernelized setting, we develop a
kernelized version of the alignment distance. For its computation, we use a
Jacobi-type method and prove its convergence to a set of critical points. We
employ it as a dissimilarity measure on Systems of Bags. As such, it
outperforms other known dissimilarity measures for kernelized linear dynamic
systems, in particular the Martin Distance and the Maximum Singular Value
Distance, in every tested classification setting. A considerable margin can be
observed in settings, where classification is performed with respect to an
abstract mean of video sets. For this scenario, the presented approach can
outperform state-of-the-art techniques, such as Dynamic Fractal Spectrum or
Orthogonal Tensor Dictionary Learning
Distance-based dynamic texture recognition is an important research field in multimedia processing with applications ranging from retrieval to segmentation of video data. Based on the conjecture that the most distinctive characteristic of a dynamic texture is the appearance of its individual frames, this work proposes to describe dynamic textures as kernelized spaces of frame-wise feature vectors computed using the Scattering transform. By combining these spaces with a basis-invariant metric, we get a framework that produces competitive results for nearest neighbor classification and state-of-the-art results for nearest class center classification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.