Audio-Visual Perception System for a Humanoid Robotic Head

One of the main issues within the field of social robotics is to endow robots with the ability to direct attention to people with whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues to localize a person using multiple sensors. However, most o...

Full description

Bibliographic Details
Main Authors: Raquel Viciana-Abad, Rebeca Marfil, Jose M. Perez-Lorenzo, Juan P. Bandera, Adrian Romero-Garces, Pedro Reche-Lopez
Format: Article
Language:English
Published: MDPI AG 2014-05-01
Series:Sensors
Subjects:
Online Access:http://www.mdpi.com/1424-8220/14/6/9522
_version_ 1811183880533704704
author Raquel Viciana-Abad
Rebeca Marfil
Jose M. Perez-Lorenzo
Juan P. Bandera
Adrian Romero-Garces
Pedro Reche-Lopez
author_facet Raquel Viciana-Abad
Rebeca Marfil
Jose M. Perez-Lorenzo
Juan P. Bandera
Adrian Romero-Garces
Pedro Reche-Lopez
author_sort Raquel Viciana-Abad
collection DOAJ
description One of the main issues within the field of social robotics is to endow robots with the ability to direct attention to people with whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues to localize a person using multiple sensors. However, most of these fusion mechanisms have been used in fixed systems, such as those used in video-conference rooms, and thus, they may incur difficulties when constrained to the sensors with which a robot can be equipped. Besides, within the scope of interactive autonomous robots, there is a lack in terms of evaluating the benefits of audio-visual attention mechanisms, compared to only audio or visual approaches, in real scenarios. Most of the tests conducted have been within controlled environments, at short distances and/or with off-line performance measurements. With the goal of demonstrating the benefit of fusing sensory information with a Bayes inference for interactive robotics, this paper presents a system for localizing a person by processing visual and audio data. Moreover, the performance of this system is evaluated and compared via considering the technical limitations of unimodal systems. The experiments show the promise of the proposed approach for the proactive detection and tracking of speakers in a human-robot interactive framework.
first_indexed 2024-04-11T13:05:18Z
format Article
id doaj.art-b8f9efca9a69454997f00f33791e44e2
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-04-11T13:05:18Z
publishDate 2014-05-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-b8f9efca9a69454997f00f33791e44e22022-12-22T04:22:49ZengMDPI AGSensors1424-82202014-05-011469522954510.3390/s140609522s140609522Audio-Visual Perception System for a Humanoid Robotic HeadRaquel Viciana-Abad0Rebeca Marfil1Jose M. Perez-Lorenzo2Juan P. Bandera3Adrian Romero-Garces4Pedro Reche-Lopez5University of Jaén, Multimedia and Multimodal Processing Group, Polytechnic School of Linares, University of Jaén Alfonso X El Sabio, 28, 23700, Linares, SpainDpto. Tecnología Electrónica, University of Málaga, Campus de Teatinos - 29071 Málaga, SpainUniversity of Jaén, Multimedia and Multimodal Processing Group, Polytechnic School of Linares, University of Jaén Alfonso X El Sabio, 28, 23700, Linares, SpainDpto. Tecnología Electrónica, University of Málaga, Campus de Teatinos - 29071 Málaga, SpainDpto. Tecnología Electrónica, University of Málaga, Campus de Teatinos - 29071 Málaga, SpainUniversity of Jaén, Multimedia and Multimodal Processing Group, Polytechnic School of Linares, University of Jaén Alfonso X El Sabio, 28, 23700, Linares, SpainOne of the main issues within the field of social robotics is to endow robots with the ability to direct attention to people with whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues to localize a person using multiple sensors. However, most of these fusion mechanisms have been used in fixed systems, such as those used in video-conference rooms, and thus, they may incur difficulties when constrained to the sensors with which a robot can be equipped. Besides, within the scope of interactive autonomous robots, there is a lack in terms of evaluating the benefits of audio-visual attention mechanisms, compared to only audio or visual approaches, in real scenarios. Most of the tests conducted have been within controlled environments, at short distances and/or with off-line performance measurements. With the goal of demonstrating the benefit of fusing sensory information with a Bayes inference for interactive robotics, this paper presents a system for localizing a person by processing visual and audio data. Moreover, the performance of this system is evaluated and compared via considering the technical limitations of unimodal systems. The experiments show the promise of the proposed approach for the proactive detection and tracking of speakers in a human-robot interactive framework.http://www.mdpi.com/1424-8220/14/6/9522multimodal perceptionbio-inspired attention mechanismhuman-robot interaction
spellingShingle Raquel Viciana-Abad
Rebeca Marfil
Jose M. Perez-Lorenzo
Juan P. Bandera
Adrian Romero-Garces
Pedro Reche-Lopez
Audio-Visual Perception System for a Humanoid Robotic Head
Sensors
multimodal perception
bio-inspired attention mechanism
human-robot interaction
title Audio-Visual Perception System for a Humanoid Robotic Head
title_full Audio-Visual Perception System for a Humanoid Robotic Head
title_fullStr Audio-Visual Perception System for a Humanoid Robotic Head
title_full_unstemmed Audio-Visual Perception System for a Humanoid Robotic Head
title_short Audio-Visual Perception System for a Humanoid Robotic Head
title_sort audio visual perception system for a humanoid robotic head
topic multimodal perception
bio-inspired attention mechanism
human-robot interaction
url http://www.mdpi.com/1424-8220/14/6/9522
work_keys_str_mv AT raquelvicianaabad audiovisualperceptionsystemforahumanoidrobotichead
AT rebecamarfil audiovisualperceptionsystemforahumanoidrobotichead
AT josemperezlorenzo audiovisualperceptionsystemforahumanoidrobotichead
AT juanpbandera audiovisualperceptionsystemforahumanoidrobotichead
AT adrianromerogarces audiovisualperceptionsystemforahumanoidrobotichead
AT pedrorechelopez audiovisualperceptionsystemforahumanoidrobotichead