Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation

This paper proposes an audio data augmentation method based on deep learning in order to improve the performance of dereverberation. Conventionally, audio data are augmented using a room impulse response, which is artificially generated by some methods, such as the image method. The proposed method...

Full description

Bibliographic Details
Main Authors: Deokgyu Yun, Seung Ho Choi
Format: Article
Language:English
Published: MDPI AG 2022-01-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/2/592
_version_ 1797490499198648320
author Deokgyu Yun
Seung Ho Choi
author_facet Deokgyu Yun
Seung Ho Choi
author_sort Deokgyu Yun
collection DOAJ
description This paper proposes an audio data augmentation method based on deep learning in order to improve the performance of dereverberation. Conventionally, audio data are augmented using a room impulse response, which is artificially generated by some methods, such as the image method. The proposed method estimates a reverberation environment model based on a deep neural network that is trained by using clean and recorded audio data as inputs and outputs, respectively. Then, a large amount of a real augmented database is constructed by using the trained reverberation model, and the dereverberation model is trained with the augmented database. The performance of the augmentation model was verified by a log spectral distance and mean square error between the real augmented data and the recorded data. In addition, according to dereverberation experiments, the proposed method showed improved performance compared with the conventional method.
first_indexed 2024-03-10T00:33:50Z
format Article
id doaj.art-d6e19364f3a848b7aac0c31b17414fde
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T00:33:50Z
publishDate 2022-01-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-d6e19364f3a848b7aac0c31b17414fde2023-11-23T15:21:14ZengMDPI AGSensors1424-82202022-01-0122259210.3390/s22020592Deep Learning-Based Estimation of Reverberant Environment for Audio Data AugmentationDeokgyu Yun0Seung Ho Choi1Department of Electronic Engineering, Seoul National University of Science and Technology, Seoul 139-743, KoreaDepartment of Electronic and IT Media Engineering, Seoul National University of Science and Technology, Seoul 139-743, KoreaThis paper proposes an audio data augmentation method based on deep learning in order to improve the performance of dereverberation. Conventionally, audio data are augmented using a room impulse response, which is artificially generated by some methods, such as the image method. The proposed method estimates a reverberation environment model based on a deep neural network that is trained by using clean and recorded audio data as inputs and outputs, respectively. Then, a large amount of a real augmented database is constructed by using the trained reverberation model, and the dereverberation model is trained with the augmented database. The performance of the augmentation model was verified by a log spectral distance and mean square error between the real augmented data and the recorded data. In addition, according to dereverberation experiments, the proposed method showed improved performance compared with the conventional method.https://www.mdpi.com/1424-8220/22/2/592audio data augmentationdereverberationdeep learningroom impulse response
spellingShingle Deokgyu Yun
Seung Ho Choi
Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
Sensors
audio data augmentation
dereverberation
deep learning
room impulse response
title Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_full Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_fullStr Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_full_unstemmed Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_short Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_sort deep learning based estimation of reverberant environment for audio data augmentation
topic audio data augmentation
dereverberation
deep learning
room impulse response
url https://www.mdpi.com/1424-8220/22/2/592
work_keys_str_mv AT deokgyuyun deeplearningbasedestimationofreverberantenvironmentforaudiodataaugmentation
AT seunghochoi deeplearningbasedestimationofreverberantenvironmentforaudiodataaugmentation