Deep Learning-Based Acoustic Echo Cancellation for Surround Sound Systems

Surround sound systems that play back multi-channel audio signals through multiple loudspeakers can improve augmented reality, which has been widely used in many multimedia communication systems. It is common that a hand-free speech communication system suffers from the acoustic echo problem, and th...

Full description

Bibliographic Details
Main Authors: Guoteng Li, Chengshi Zheng, Yuxuan Ke, Xiaodong Li
Format: Article
Language:English
Published: MDPI AG 2023-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/3/1266
_version_ 1797625238730571776
author Guoteng Li
Chengshi Zheng
Yuxuan Ke
Xiaodong Li
author_facet Guoteng Li
Chengshi Zheng
Yuxuan Ke
Xiaodong Li
author_sort Guoteng Li
collection DOAJ
description Surround sound systems that play back multi-channel audio signals through multiple loudspeakers can improve augmented reality, which has been widely used in many multimedia communication systems. It is common that a hand-free speech communication system suffers from the acoustic echo problem, and the echo needs to be canceled or suppressed completely. This paper proposes a deep learning-based acoustic echo cancellation (AEC) method to recover the desired near-end speech from the microphone signals in surround sound systems. The ambisonics technique was adopted to record the surround sound for reproduction. To achieve a better generalization capability against different loudspeaker layouts, the compressed complex spectra of the first-order ambisonic signals (B-format) were sent to the neural network as the input features directly instead of using the ambisonic decoded signals (D-format). Experimental results on both simulated and real acoustic environments showed the effectiveness of the proposed algorithm in surround AEC, and outperformed other competing methods in terms of the speech quality and the amount of echo reduction.
first_indexed 2024-03-11T09:53:44Z
format Article
id doaj.art-21c9b2b46da544d4b77a3a28db5299eb
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T09:53:44Z
publishDate 2023-01-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-21c9b2b46da544d4b77a3a28db5299eb2023-11-16T16:02:58ZengMDPI AGApplied Sciences2076-34172023-01-01133126610.3390/app13031266Deep Learning-Based Acoustic Echo Cancellation for Surround Sound SystemsGuoteng Li0Chengshi Zheng1Yuxuan Ke2Xiaodong Li3Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, ChinaKey Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, ChinaKey Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, ChinaKey Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, ChinaSurround sound systems that play back multi-channel audio signals through multiple loudspeakers can improve augmented reality, which has been widely used in many multimedia communication systems. It is common that a hand-free speech communication system suffers from the acoustic echo problem, and the echo needs to be canceled or suppressed completely. This paper proposes a deep learning-based acoustic echo cancellation (AEC) method to recover the desired near-end speech from the microphone signals in surround sound systems. The ambisonics technique was adopted to record the surround sound for reproduction. To achieve a better generalization capability against different loudspeaker layouts, the compressed complex spectra of the first-order ambisonic signals (B-format) were sent to the neural network as the input features directly instead of using the ambisonic decoded signals (D-format). Experimental results on both simulated and real acoustic environments showed the effectiveness of the proposed algorithm in surround AEC, and outperformed other competing methods in terms of the speech quality and the amount of echo reduction.https://www.mdpi.com/2076-3417/13/3/1266acoustic echo cancellationsurround soundambisonics
spellingShingle Guoteng Li
Chengshi Zheng
Yuxuan Ke
Xiaodong Li
Deep Learning-Based Acoustic Echo Cancellation for Surround Sound Systems
Applied Sciences
acoustic echo cancellation
surround sound
ambisonics
title Deep Learning-Based Acoustic Echo Cancellation for Surround Sound Systems
title_full Deep Learning-Based Acoustic Echo Cancellation for Surround Sound Systems
title_fullStr Deep Learning-Based Acoustic Echo Cancellation for Surround Sound Systems
title_full_unstemmed Deep Learning-Based Acoustic Echo Cancellation for Surround Sound Systems
title_short Deep Learning-Based Acoustic Echo Cancellation for Surround Sound Systems
title_sort deep learning based acoustic echo cancellation for surround sound systems
topic acoustic echo cancellation
surround sound
ambisonics
url https://www.mdpi.com/2076-3417/13/3/1266
work_keys_str_mv AT guotengli deeplearningbasedacousticechocancellationforsurroundsoundsystems
AT chengshizheng deeplearningbasedacousticechocancellationforsurroundsoundsystems
AT yuxuanke deeplearningbasedacousticechocancellationforsurroundsoundsystems
AT xiaodongli deeplearningbasedacousticechocancellationforsurroundsoundsystems