Speech separation based on reliable binaural cues with two-stage neural network in noisy-reverberant environments

Li, Ruwei; Li, Tao; Sun, Xiaoyue; Sun, Xingwu; Zhao, Fengnian

doi:10.1016/j.apacoust.2020.107445

Cited by 4 publications

(3 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Marginalizing over all sources and all delays, the log likelihood function for DP is given as in (16).…”

Section: ( ) ( ) ( )mentioning

confidence: 99%

“…With the development of Neural Networks (NNs), there has been tremendous improvement in a variety of speech recognition and acoustic signal processing tasks [16]. The binaural dereverberation models in [8], [16] and [17] uses artificial neural network (ANN) for binaural dereverberation preprocessing, the model in [18] uses the recurrent neural network (RNN) and interaural cues for speech enhancement in reverberant noisy conditions, while the models in [19] and [20] use the U-Net (a deep convolutional neural network (CNN)) for dereverberation, but these are monaural models.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source

Gul

Khan

Shah

2023

Computer Speech & Language

View full text Add to dashboard Cite

“…Marginalizing over all sources and all delays, the log likelihood function for DP is given as in (16).…”

Section: ( ) ( ) ( )mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source

Gul

Khan

Shah

2023

Computer Speech & Language

View full text Add to dashboard Cite

“…Due to the interference of external noise and the instability of speech signals, traditional SR methods are difficult to achieve high accuracy and robustness, and it is also difficult to achieve the expected performance in practical applications 9 . In recent years, due to the significant improvement in computer computing performance and the rise of DL, research on SR has also made tremendous progress 10 .…”

mentioning

confidence: 99%

Application practice of neural network algorithms in speech recognition technology

Guo

2024

Second International Conference on Physics, Photonics, and Optical Engineering (ICPPOE 2023)

View full text Add to dashboard Cite

Speech recognition (SR) technology, as one of the core technologies of human-computer interaction, aims to enable computers to understand the process of converting speech signals into corresponding text or commands through natural language. With the exponential increase of internet information, the features of massive speech data have significant non-specific differences and noise interference. Common feature extraction and transformation methods are no longer sufficient to meet the current needs of model training and recognition. With the rapid growth of machine learning (ML), many researchers use neural networks (NN) to solve various problems in the SR field. This article designs a deep learning (DL) algorithm based on convolutional neural networks (CNN) and recurrent neural networks (RNN) for SR. Firstly, sample filtering, pre weighting, signal framing, and endpoint detection are performed on the speech signal. Secondly, the MFCC value of the preprocessed data is extracted. Finally, an NN model is trained and constructed, and the trained qualified model is used to complete the recognition of speech features. The experimental results show that the algorithm designed in this paper has a lower error rate for SR and stronger generalization ability, which is of great significance for the study of SR.

show abstract

Artificial Intelligence Tools for Wind Turbine Blade Monitoring

Lam,

Simani

2024

Lecture Notes in Networks and Systems

View full text Add to dashboard Cite

Speech separation based on reliable binaural cues with two-stage neural network in noisy-reverberant environments

Cited by 4 publications

References 44 publications

Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source

Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source

Application practice of neural network algorithms in speech recognition technology

Artificial Intelligence Tools for Wind Turbine Blade Monitoring

Contact Info

Product

Resources

About