Reading to listen at the cocktail party: multi-modal speech separation

The goal of this paper is speech separation and enhancement in multi-speaker and noisy environments using a combination of different modalities. Previous works have shown good performance when conditioning on temporal or static visual evidence such as synchronised lip movements or face identity. In...

Full description

Bibliographic Details
Main Authors: Rahimi, A, Afouras, T, Zisserman, A
Format: Conference item
Language:English
Published: IEEE 2022