2022
DOI: 10.3390/s22155598
|View full text |Cite
|
Sign up to set email alerts
|

Time–Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation

Abstract: Underwater acoustic signal separation is a key technique for underwater communications. The existing methods are mostly model-based, and cannot accurately characterize the practical underwater acoustic communication environment. They are only suitable for binary signal separation and cannot handle multivariate signal separation. However, recurrent neural networks (RNNs) show a powerful ability to extract the features of temporal sequences. Inspired by this, in this paper, we present a data-driven approach for … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 45 publications
0
5
0
Order By: Relevance
“…Also, the accuracy of the proposed method for underwater acoustic target recognition compared with other introduced methods in Table 6. Wavelet Transform (Average Power Spectral Density) [10] 92.64% CNN+k-nearest neighbor Wavelet Packets [11] 90.30% Deep Convolutional Neural Network Synthetic Aperture Sonar Imagery [13] 90.89% Support Vector Machine (SVM) Competitive Deep-Belief Networks [14] 93.04% DCGAN + S-ResNet Spectrum Image [15] 83.15% Multi-Scale Residual Unit (MSRU) Spectrogram [16] 90.91% Separable Convolutional Neural Network Waveform [17] 95.22% Convolutional Neural Network Low-Frequency Analysis Recording (LOFAR) [18] 98.52% Support Vector Machine (SVM) micro-Doppler sonar [19] 94.31% ResNet-18 Fusion Features [20] 96.32% ResNet Multi-Window Spectral Analysis (MWSA) [21] 97.69% ResNet and DensNet Spectrogram [22] 94.00% Convolutional Neural Network DEMON and LOFAR [23] 97.00% Bidirectional Short-Term Memory (Bi-LSTM) Time-Frequency Diagrams [24] 96.90% Convolutional Neural Network Acoustic Spectrograms [25] 98 As shown in the table above, compared to the existing methods for performing UATR, the proposed models have high classification accuracy, which can increase the processing speed of target recognition and avoid wasting time in model training calculation operations. In addition, in this research, due to the use of the average integration method at the end of the layers of the convolutional algorithms of the proposed model, instead of the fully connected layer, it has been tried to reduce the complexity and increase calculations.…”
Section: B Experimental Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Also, the accuracy of the proposed method for underwater acoustic target recognition compared with other introduced methods in Table 6. Wavelet Transform (Average Power Spectral Density) [10] 92.64% CNN+k-nearest neighbor Wavelet Packets [11] 90.30% Deep Convolutional Neural Network Synthetic Aperture Sonar Imagery [13] 90.89% Support Vector Machine (SVM) Competitive Deep-Belief Networks [14] 93.04% DCGAN + S-ResNet Spectrum Image [15] 83.15% Multi-Scale Residual Unit (MSRU) Spectrogram [16] 90.91% Separable Convolutional Neural Network Waveform [17] 95.22% Convolutional Neural Network Low-Frequency Analysis Recording (LOFAR) [18] 98.52% Support Vector Machine (SVM) micro-Doppler sonar [19] 94.31% ResNet-18 Fusion Features [20] 96.32% ResNet Multi-Window Spectral Analysis (MWSA) [21] 97.69% ResNet and DensNet Spectrogram [22] 94.00% Convolutional Neural Network DEMON and LOFAR [23] 97.00% Bidirectional Short-Term Memory (Bi-LSTM) Time-Frequency Diagrams [24] 96.90% Convolutional Neural Network Acoustic Spectrograms [25] 98 As shown in the table above, compared to the existing methods for performing UATR, the proposed models have high classification accuracy, which can increase the processing speed of target recognition and avoid wasting time in model training calculation operations. In addition, in this research, due to the use of the average integration method at the end of the layers of the convolutional algorithms of the proposed model, instead of the fully connected layer, it has been tried to reduce the complexity and increase calculations.…”
Section: B Experimental Resultsmentioning
confidence: 99%
“…Song et al [23] by combining the lowfrequency analysis recording (LOFAR) and Envelope modulation on noise (DEMON) and CNN network have been able to achieve 94.00% recognition accuracy. Chen et al [24] proposed a method based on a bi-directional short-term memory (Bi-LSTM) to discover the features of a time-frequency mask to extract distinctive features of the underwater audio signal. Sheng and Zhu [25] proposed an underwater acoustic target detection method based on a UATR transformer to detect two classes of targets, which can capture global and local information on spectrograms, thereby improving the performance of UATR.…”
Section: Related Workmentioning
confidence: 99%
“…(Song, et al, 2022) by combining the low-frequency analysis recording (LOFAR) and Envelope modulation on noise (DEMON) and CNN network have been able to achieve 94.00% recognition accuracy. (Chen, et al, 2022) proposed a method based on a bi-directional short-term memory (Bi-LSTM) to discover the features of a time-frequency mask to extract distinctive features of the underwater audio signal. (Sheng & Zhu, 2023) proposed an underwater acoustic target detection method based on a UATR transformer to detect two classes of targets, which can capture global and local information on spectrograms, thereby improving the performance of UATR.…”
Section: Related Workmentioning
confidence: 99%
“…With the development and rise of artificial intelligence in recent years, algorithms combining artificial intelligence with antireverberation technology continue to surge, such as support vector machines, CNN (Song et al, 2019), RNN (Chen et al, 2022), and GAN In the beginning, it was simply a simple addition to machine learning. For example, Zhu et al designed a feature kernel function SVM based on the non-Gaussian difference between reverberation and target echo to detect the signal in the reverberation background.…”
Section: Application Of Artificial Intelligence In Reverberation Supp...mentioning
confidence: 99%