This paper describes an original approach which jointly addresses two fundamental issues of video partitioning which represent the early important stage of any content-based video indexing system. These two issues are the detection of shot changes, and the labeling of the shot con guration related to the camera movement in terms of static shot, panning, traveling, zooming,... They are both derived from the computation, at each time instant, of the dominant motion in the image represented by a 2D a ne model, and from the variation of the size of its associated support. The successive steps of the method rely on statistical techniques ensuring robustness and e ciency. In particular, it can cope with scenes containing moving objects. Results on a real documentary video are reported and validate the proposed approach. Unité de recherche INRIA Rennes IRISA, Campus universitaire de Beaulieu, 35042 RENNES Cedex (France) Téléphone : 02 99 84 71 00 -International : +33 2 99 84 71 00 R sum : Nous d crivons une approche traitant conjointement deux probl mes fondamentaux du partitionnement de vid o en plan, la d tection des changements de plan et l' tiquetage d'une con guration de plan reli un mouvement de cam ra en termes de plan xe, traveling, zoom... Il s'agit d' une premi re tape importante pour tout syst me d'indexation de vid o par le contenu. La m thode propos e exploite, pour les deux aspects du probl me, l'estimation du mouvement dominant dans l'image, repr sent par un mod le a ne 2D, et de l' volution temporelle du support associ . Les tapes successives de cette m thode reposent sur des techniques statistiques, qui en assurent la robustesse et l'e cacit . Mots-cl : D coupage en plan, Mouvement de cam ra, Indexation vid o
Usage of mobile devices (phones, digital cameras) raises the need for organizing large personal image collections. In accordance with studies on user needs, we propose a statistical criterion and an associated optimization technique, relying on geo-temporal image metadata, for building and tracking a hierarchical structure on the image collection. In a mixture model framework, particularities of the application and typical data sets are taken into account in the design of the scheme (incrementality, ability to cope with non-Gaussian data, with both small and large samples). Results are reported on real data sets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.