Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface
A method of detecting speech events in a multiple-sound-source condition using audio and video information is proposed. For detecting speech events, sound localization using a microphone array and human tracking by stereo vision is combined by a Bayesian network. From the inference results of the Ba...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2004-09-01
|
Series: | EURASIP Journal on Advances in Signal Processing |
Subjects: | |
Online Access: | http://dx.doi.org/10.1155/S1110865704402303 |