Reading to listen at the cocktail party: multi-modal speech separation

Reading to listen at the cocktail party: multi-modal speech separation

The goal of this paper is speech separation and enhancement in multi-speaker and noisy environments using a combination of different modalities. Previous works have shown good performance when conditioning on temporal or static visual evidence such as synchronised lip movements or face identity. In...

Full description

Bibliographic Details
Main Authors:	Rahimi, A, Afouras, T, Zisserman, A
Format:	Conference item
Language:	English
Published:	IEEE 2022

Similar Items

Cocktail-party listening and cognitive abilities show strong pleiotropy
by: Samuel R. Mathias, et al.
Published: (2023-03-01)

AudioStreamer--leveraging the cocktail party effect for efficient listening
by: Mullins, Atty Thomas
Published: (2006)

ASR is all you need: cross-modal distillation for lip reading
by: Afouras, T, et al.
Published: (2020)

Voicevector: multimodal enrolment vectors for speaker separation
by: Rahimi, A, et al.
Published: (2024)

Musicians Show Improved Speech Segregation in Competitive, Multi-Talker Cocktail Party Scenarios
by: Gavin M. Bidelman, et al.
Published: (2020-08-01)

Linguistic processing of task-irrelevant speech at a cocktail party
by: Paz Har-shai Yahav, et al.
Published: (2021-05-01)

Inharmonic speech reveals the role of harmonicity in the cocktail party problem
by: Sara Popham, et al.
Published: (2018-05-01)

Inharmonic speech reveals the role of harmonicity in the cocktail party problem
by: Ellis, Dan P. W., et al.
Published: (2018)

Do we parse the background into separate streams in the cocktail party?
by: Orsolya Szalárdy, et al.
Published: (2022-10-01)

Familiarity of Background Music Modulates the Cortical Tracking of Target Speech at the “Cocktail Party”
by: Jane A. Brown, et al.
Published: (2022-09-01)

Speech recognition models are strong lip-readers
by: Prajwal, KR, et al.
Published: (2024)

Individual differences in selective attention predict speech identification at a cocktail party
by: Daniel Oberfeld, et al.
Published: (2016-08-01)

Breaking down the cocktail party: Attentional modulation of cerebral audiovisual speech processing
by: Patrik Wikman, et al.
Published: (2021-01-01)

Cross-modal interactions at the audiovisual cocktail-party revealed by behavior, ERPs, and neural oscillations
by: Laura-Isabelle Klatt, et al.
Published: (2023-05-01)

My lips are concealed: audio-visual speech enhancement through obstructions
by: Afouras, T, et al.
Published: (2019)

Sub-word level lip reading with visual attention
by: Prajwal, KR, et al.
Published: (2022)

The integration of continuous audio and visual speech in a cocktail-party environment depends on attention
by: Farhin Ahmed, et al.
Published: (2023-07-01)

Effects of age on electrophysiological correlates of speech processing in a dynamic cocktail-party situation
by: Stephan eGetzmann, et al.
Published: (2015-09-01)

Deep lip reading: a comparison of models and an online application
by: Afouras, T, et al.
Published: (2018)

Neural speech restoration at the cocktail party: Auditory cortex recovers masked speech of both attended and ignored speakers.
by: Christian Brodbeck, et al.
Published: (2020-10-01)

Does Modality Matter? The Effects of Reading, Listening, and Dual Modality on Comprehension
by: Beth A. Rogowsky, et al.
Published: (2016-09-01)

Schema learning for the cocktail party problem
by: Woods, Kevin Jing Poh, et al.
Published: (2018)

Il cocktail party effect nelle sale di ristorazione - The cocktail party effect in restaurant dining rooms
by: Sebastiano Andrea Boemi, et al.
Published: (2019-08-01)

Deep audio-visual speech recognition
by: Afouras, T, et al.
Published: (2018)

The Genetic contribution to solving the cocktail-party problem
by: Samuel R. Mathias, et al.
Published: (2022-09-01)

Web party effect: a cocktail party effect in the web environment
by: Sara Rigutti, et al.
Published: (2015-03-01)

Finding your mate at a cocktail party: frequency separation promotes auditory stream segregation of concurrent voices in multi-species frog choruses.
by: Vivek Nityananda, et al.
Published: (2011-01-01)

Read and attend: temporal localisation in sign language videos
by: Varol, G, et al.
Published: (2021)

Brain activity during shadowing of audiovisual cocktail party speech, contributions of auditory–motor integration and selective attention
by: Patrik Wikman, et al.
Published: (2022-11-01)

Changes in breathing while listening to read speech: the effect of reader and speech mode
by: Amélie eRochet-Capellan, et al.
Published: (2013-12-01)

A cocktail party at the London Irish Women's Centre
by: London Irish Women's Centre, LIWC
Published: (1987)

On Total Vertex Irregularity Strength of Cocktail Party Graph
by: Kristiana Wijaya, et al.
Published: (2011-07-01)

Benefits of Acoustic Beamforming for Solving the Cocktail Party Problem
by: Gerald Kidd, et al.
Published: (2015-06-01)

Watch, read and lookup: learning to spot signs from multiple supervisors
by: Momeni, L, et al.
Published: (2021)

Separate neural subsystems support goal-directed speech listening
by: Liu-Fang Zhou, et al.
Published: (2022-11-01)

EEG-based Auditory Attention Detection in Cocktail Party Environment
by: Siqi Cai, et al.
Published: (2023-01-01)

Bats enhance their call identities to solve the cocktail party problem
by: Kazuma Hase, et al.
Published: (2018-05-01)

Emotion recognition in speech using cross-modal transfer in the wild
by: Albanie, S, et al.
Published: (2018)

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
by: Albanie, S, et al.
Published: (2018)

Disentangled Speech Embeddings Using Cross-Modal Self-Supervision
by: Nagrani, A, et al.
Published: (2020)