2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6638334
|View full text |Cite
|
Sign up to set email alerts
|

Audio constrained particle filter based visual tracking

Abstract: We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environment. A challenging problem with visual tracking is to deal with occlusions (caused by the limited field of view of cameras or by other speakers). Another challenge is associated with the particle filtering (PF) algorithm, commonly used for visual tracking, which requires a large number of particles to ensure the distribution is well modelled. In this paper, we propose a new method of fusing audio into the PF base… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
17
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 11 publications
(17 citation statements)
references
References 12 publications
0
17
0
Order By: Relevance
“…The DOA data is introduced to the SMC-PHD filter based on [34] and [36] where the efficiency of the particles is improved under a particle filter framework by re-allocating all the particles around the DOA line which is drawn from the center of the microphone array to a point in the image frame estimated by the projection of DOA to 2D image plane. However, different from [34] and [36] in which the DOA is used in the same way for all the particles, here the contribution of the DOA information is varied depending on the type of the particles.…”
Section: Audio-visual Tracker With Smc-phd Filtermentioning
confidence: 99%
See 3 more Smart Citations
“…The DOA data is introduced to the SMC-PHD filter based on [34] and [36] where the efficiency of the particles is improved under a particle filter framework by re-allocating all the particles around the DOA line which is drawn from the center of the microphone array to a point in the image frame estimated by the projection of DOA to 2D image plane. However, different from [34] and [36] in which the DOA is used in the same way for all the particles, here the contribution of the DOA information is varied depending on the type of the particles.…”
Section: Audio-visual Tracker With Smc-phd Filtermentioning
confidence: 99%
“…However, different from [34] and [36] in which the DOA is used in the same way for all the particles, here the contribution of the DOA information is varied depending on the type of the particles. Similar to [34] and [36], we also use the samspare-mean (SSM) method [48] for the DOA estimation which is further enhanced by a third-order Auto-Regressive (AR) model. We should note that there are other audio features and algorithms for extracting these features that could be used in our proposed system, however, exploring other audio detection methods is beyond the scope of this work.…”
Section: Audio-visual Tracker With Smc-phd Filtermentioning
confidence: 99%
See 2 more Smart Citations
“…If the calibration information is not available, the positions of the microphone arrays could be estimated via microphone self-calibration [50] or combined microphone and camera calibrations [51], which is however beyond the scope of this study. The idea behind our approach is to relocate the distributed particles around the DOA line and then re-calculate the weights of the relocated particles according to their distance to the DOA line [52]. The DOA line can be drawn as follows.…”
Section: Proposed Particle Filter Based Audio Constraint Visual mentioning
confidence: 99%