2009
DOI: 10.1016/j.patrec.2008.04.001
|View full text |Cite
|
Sign up to set email alerts
|

Video object matching across multiple independent views using local descriptors and adaptive learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
25
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 43 publications
(26 citation statements)
references
References 19 publications
1
25
0
Order By: Relevance
“…Consequently, their identities can be switched. Moreover, prolonged occlusion might occur, which might lead to track loss or mistaken identities (Teixeira and Corte-Real 2009). Since the cameras are supposed to track all objects in their coverage area, the definition of a global identity for each object is necessary.…”
Section: Relevance and Problem Definitionmentioning
confidence: 99%
“…Consequently, their identities can be switched. Moreover, prolonged occlusion might occur, which might lead to track loss or mistaken identities (Teixeira and Corte-Real 2009). Since the cameras are supposed to track all objects in their coverage area, the definition of a global identity for each object is necessary.…”
Section: Relevance and Problem Definitionmentioning
confidence: 99%
“…As an alternative, a classifier can be trained to learn the changes between cameras using labeled features. Support Vector Machines (SVM) can be employed with DCT features (Bauml et al, 2010) and SIFT (Teixeira and Corte-Real, 2009). An improvement is the Ensemble SVM, which reduces the computational cost of rankSVM for high-dimensional feature spaces besides converting the re-identification problem into a ranking problem (Prosser et al, 2010).…”
Section: Associationmentioning
confidence: 99%
“…Finally, interest points can be used for re-identification in case of variations in scale, pose and illumination (Bauml and Stiefelhagen, 2011). Examples are SIFT (Teixeira and Corte-Real, 2009), SURF-like features (Hamdoun et al, 2008;Oliveira and Luiz, 2009) and the Hessian Affine invariant operator (Gheissari et al, 2006). When intra-camera tracking information is available, features extracted from single images can be grouped over time either by temporal accumulation (Hamdoun et al, 2008) or by clustering (Farenzena et al, 2010).…”
mentioning
confidence: 99%
“…In [4], each object is represented as a "bag-of-visterms" where the visual words are local features. A model is created for each individual detected in the site.…”
Section: Related Workmentioning
confidence: 99%
“…When the cameras have overlapping fields of view, information about the geometrical relations among the camera views can be estimated and used to establish correspondences [1,2]. In the case of disjoint views, other information about the moving objects must be used to automatically identify multiple instances of the same object [3,4].…”
Section: Introductionmentioning
confidence: 99%