2023
DOI: 10.3390/s23156969
|View full text |Cite
|
Sign up to set email alerts
|

Audiovisual Tracking of Multiple Speakers in Smart Spaces

Frank Sanabria-Macias,
Marta Marron-Romera,
Javier Macias-Guarasa

Abstract: This paper presents GAVT, a highly accurate audiovisual 3D tracking system based on particle filters and a probabilistic framework, employing a single camera and a microphone array. Our first contribution is a complex visual appearance model that accurately locates the speaker’s mouth. It transforms a Viola & Jones face detector classifier kernel into a likelihood estimator, leveraging knowledge from multiple classifiers trained for different face poses. Additionally, we propose a mechanism to handle occlu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 60 publications
(148 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?