2015
DOI: 10.1109/tmm.2014.2377515
|View full text |Cite
|
Sign up to set email alerts
|

Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering

Abstract: Abstract-The problem of tracking multiple moving speakers in indoor environments has recently received much attention. Earlier techniques were based purely on vision, but the theoretical and algorithmic advances and a constant growth in speed of processing have led to the emergence of techniques which allow the fusion of audio and visual data. The fusion of multi-modal information has been shown to be instrumental in improving tracking performance, as well as robustness in the case of challenging situations li… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
20
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
3
3
1

Relationship

3
4

Authors

Journals

citations
Cited by 59 publications
(20 citation statements)
references
References 49 publications
0
20
0
Order By: Relevance
“…For the evolution of the time dependent speaker state, the constant velocity model is employed [36], [45] given as,…”
Section: Multi-speaker Tracking With the Phd Filtermentioning
confidence: 99%
See 3 more Smart Citations
“…For the evolution of the time dependent speaker state, the constant velocity model is employed [36], [45] given as,…”
Section: Multi-speaker Tracking With the Phd Filtermentioning
confidence: 99%
“…The DOA data is introduced to the SMC-PHD filter based on [34] and [36] where the efficiency of the particles is improved under a particle filter framework by re-allocating all the particles around the DOA line which is drawn from the center of the microphone array to a point in the image frame estimated by the projection of DOA to 2D image plane. However, different from [34] and [36] in which the DOA is used in the same way for all the particles, here the contribution of the DOA information is varied depending on the type of the particles.…”
Section: Audio-visual Tracker With Smc-phd Filtermentioning
confidence: 99%
See 2 more Smart Citations
“…There is a consensus that different modalities are complementary to each other, which has motivated an increasing interest in cross-modal tracking in the last decade. Most of these works are done in the audio-visual domain [24,8,9]. Combination of other modalities has recently started to become more popular.…”
Section: Introductionmentioning
confidence: 99%