Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation

Underwater acoustic signal separation is a key technique for underwater communications. The existing methods are mostly model-based, and cannot accurately characterize the practical underwater acoustic communication environment. They are only suitable for binary signal separation and cannot handle m...

Full description

Bibliographic Details
Main Authors: Jie Chen, Chang Liu, Jiawu Xie, Jie An, Nan Huang
Format: Article
Language:English
Published: MDPI AG 2022-07-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/15/5598
_version_ 1827624629828583424
author Jie Chen
Chang Liu
Jiawu Xie
Jie An
Nan Huang
author_facet Jie Chen
Chang Liu
Jiawu Xie
Jie An
Nan Huang
author_sort Jie Chen
collection DOAJ
description Underwater acoustic signal separation is a key technique for underwater communications. The existing methods are mostly model-based, and cannot accurately characterize the practical underwater acoustic communication environment. They are only suitable for binary signal separation and cannot handle multivariate signal separation. However, recurrent neural networks (RNNs) show a powerful ability to extract the features of temporal sequences. Inspired by this, in this paper, we present a data-driven approach for underwater acoustic signal separation using deep learning technology. We use a bidirectional long short-term memory (Bi-LSTM) approach to explore the features of a time–frequency (T-F) mask, and propose a T-F-mask-aware Bi-LSTM for signal separation. Taking advantage of the sparseness of the T-F image, the designed Bi-LSTM network is able to extract the discriminative features for separation, which further improves the separation performance. In particular, this method breaks through the limitations of the existing methods and not only achieves good results in multivariate separation but also effectively separates signals when they are mixed with 40 dB Gaussian noise signals. The experimental results show that this method can achieve a 97% guarantee ratio (PSR), and the average similarity coefficient of the multivariate signal separation is stable above 0.8 under high noise conditions. It should be noted that our model can only handle known signals such as test signals for calibration.
first_indexed 2024-03-09T12:12:19Z
format Article
id doaj.art-fcf95af9b2754e51ac8fa01dbb256df6
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T12:12:19Z
publishDate 2022-07-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-fcf95af9b2754e51ac8fa01dbb256df62023-11-30T22:50:47ZengMDPI AGSensors1424-82202022-07-012215559810.3390/s22155598Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal SeparationJie Chen0Chang Liu1Jiawu Xie2Jie An3Nan Huang4National Key Laboratory of Science and Technology on Communication, University of Electronic Science and Technology of China, Chengdu 610000, ChinaNational Key Laboratory of Science and Technology on Communication, University of Electronic Science and Technology of China, Chengdu 610000, ChinaNational Key Laboratory of Science and Technology on Communication, University of Electronic Science and Technology of China, Chengdu 610000, ChinaNational Key Laboratory of Science and Technology on Communication, University of Electronic Science and Technology of China, Chengdu 610000, ChinaNational Key Laboratory of Science and Technology on Communication, University of Electronic Science and Technology of China, Chengdu 610000, ChinaUnderwater acoustic signal separation is a key technique for underwater communications. The existing methods are mostly model-based, and cannot accurately characterize the practical underwater acoustic communication environment. They are only suitable for binary signal separation and cannot handle multivariate signal separation. However, recurrent neural networks (RNNs) show a powerful ability to extract the features of temporal sequences. Inspired by this, in this paper, we present a data-driven approach for underwater acoustic signal separation using deep learning technology. We use a bidirectional long short-term memory (Bi-LSTM) approach to explore the features of a time–frequency (T-F) mask, and propose a T-F-mask-aware Bi-LSTM for signal separation. Taking advantage of the sparseness of the T-F image, the designed Bi-LSTM network is able to extract the discriminative features for separation, which further improves the separation performance. In particular, this method breaks through the limitations of the existing methods and not only achieves good results in multivariate separation but also effectively separates signals when they are mixed with 40 dB Gaussian noise signals. The experimental results show that this method can achieve a 97% guarantee ratio (PSR), and the average similarity coefficient of the multivariate signal separation is stable above 0.8 under high noise conditions. It should be noted that our model can only handle known signals such as test signals for calibration.https://www.mdpi.com/1424-8220/22/15/5598blind source separationbinary maskdeep learningunderwater acoustic signal
spellingShingle Jie Chen
Chang Liu
Jiawu Xie
Jie An
Nan Huang
Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
Sensors
blind source separation
binary mask
deep learning
underwater acoustic signal
title Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
title_full Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
title_fullStr Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
title_full_unstemmed Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
title_short Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
title_sort time frequency mask aware bidirectional lstm a deep learning approach for underwater acoustic signal separation
topic blind source separation
binary mask
deep learning
underwater acoustic signal
url https://www.mdpi.com/1424-8220/22/15/5598
work_keys_str_mv AT jiechen timefrequencymaskawarebidirectionallstmadeeplearningapproachforunderwateracousticsignalseparation
AT changliu timefrequencymaskawarebidirectionallstmadeeplearningapproachforunderwateracousticsignalseparation
AT jiawuxie timefrequencymaskawarebidirectionallstmadeeplearningapproachforunderwateracousticsignalseparation
AT jiean timefrequencymaskawarebidirectionallstmadeeplearningapproachforunderwateracousticsignalseparation
AT nanhuang timefrequencymaskawarebidirectionallstmadeeplearningapproachforunderwateracousticsignalseparation