Masked Spectrogram Prediction for Unsupervised Domain Adaptation in Speech Enhancement
Supervised learning-based speech enhancement methods often work remarkably well in acoustic situations represented in the training corpus but generalize poorly to out-of-domain situations, i.e. situations not seen during training. This stands in the way of further improvement of these methods in rea...
Main Authors: | Katerina Zmolikova, Michael Syskind Pedersen, Jesper Jensen |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2024-01-01
|
Series: | IEEE Open Journal of Signal Processing |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10360251/ |
Similar Items
-
Transferable adversarial masked self-distillation for unsupervised domain adaptation
by: Yuelong Xia, et al.
Published: (2023-05-01) -
Unsupervised Adaptation of Deep Speech Activity Detection Models to Unseen Domains
by: Pablo Gimeno, et al.
Published: (2022-02-01) -
Mel-MViTv2: Enhanced Speech Emotion Recognition With Mel Spectrogram and Improved Multiscale Vision Transformers
by: Kah Liang Ong, et al.
Published: (2023-01-01) -
Domain‐invariant adversarial learning with conditional distribution alignment for unsupervised domain adaptation
by: Xingmei Wang, et al.
Published: (2020-12-01) -
Speech Emotion Recognition Using a Dual-Channel Complementary Spectrogram and the CNN-SSAE Neutral Network
by: Juan Li, et al.
Published: (2022-09-01)