Speech Enhancement Using MLP-Based Architecture With Convolutional Token Mixing Module and Squeeze-and-Excitation Network
The Conformer has shown impressive performance for speech enhancement by exploiting the local and global contextual information, although it requires high computational complexity and many parameters. Recently, multi-layer perceptron (MLP)-based models such as MLP-mixer and gMLP have demonstrated co...
Main Authors: | Hyungchan Song, Minseung Kim, Jong Won Shin |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2022-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9945958/ |
Similar Items
-
Improved Speech Spatial Covariance Matrix Estimation for Online Multi-Microphone Speech Enhancement
by: Minseung Kim, et al.
Published: (2022-12-01) -
On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms
by: Elena Verdú, et al.
Published: (2020-06-01) -
Supervised Attention Multi-Scale Temporal Convolutional Network for monaural speech enhancement
by: Zehua Zhang, et al.
Published: (2024-04-01) -
Multi‐stage attention network for monaural speech enhancement
by: Kunpeng Wang, et al.
Published: (2023-03-01) -
Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model
by: Lotter Thomas, et al.
Published: (2005-01-01)