Reading to listen at the cocktail party: multi-modal speech separation
The goal of this paper is speech separation and enhancement in multi-speaker and noisy environments using a combination of different modalities. Previous works have shown good performance when conditioning on temporal or static visual evidence such as synchronised lip movements or face identity. In...
Main Authors: | Rahimi, A, Afouras, T, Zisserman, A |
---|---|
Format: | Conference item |
Language: | English |
Published: |
IEEE
2022
|
Similar Items
-
Cocktail-party listening and cognitive abilities show strong pleiotropy
by: Samuel R. Mathias, et al.
Published: (2023-03-01) -
AudioStreamer--leveraging the cocktail party effect for efficient listening
by: Mullins, Atty Thomas
Published: (2006) -
ASR is all you need: cross-modal distillation for lip reading
by: Afouras, T, et al.
Published: (2020) -
Voicevector: multimodal enrolment vectors for speaker separation
by: Rahimi, A, et al.
Published: (2024) -
Musicians Show Improved Speech Segregation in Competitive, Multi-Talker Cocktail Party Scenarios
by: Gavin M. Bidelman, et al.
Published: (2020-08-01)