Audiovisual Tracking of Multiple Speakers in Smart Spaces

This paper presents GAVT, a highly accurate audiovisual 3D tracking system based on particle filters and a probabilistic framework, employing a single camera and a microphone array. Our first contribution is a complex visual appearance model that accurately locates the speaker’s mouth. It transforms...

Full description

Bibliographic Details
Main Authors: Frank Sanabria-Macias, Marta Marron-Romera, Javier Macias-Guarasa
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/15/6969