2023
DOI: 10.1109/access.2023.3235010
|View full text |Cite
|
Sign up to set email alerts
|

State-of-the-Art Analysis of Deep Learning-Based Monaural Speech Source Separation Techniques

Abstract: The monaural speech source separation problem is an important application in the signal processing field. But recent interaction of deep learning algorithms with signal processing achieves remarkable performance improvement for speech source separation problems. This paper explores the numerous state-of-the-art deep learning-based monaural speech source separation algorithms in the timefrequency (T-F), time, and hybrid domains. The motivation, algorithm, and framework of different deep learning models for mona… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 138 publications
0
0
0
Order By: Relevance
“…A long-term objective in the field of audio processing research is to computationally emulate the selective hearing ability of humans. Early research in audio segregation primarily focused on speech separation, which is a special case of audio separation [2], [3]. With the advancements in deep learning, several deep neural network-based approaches have been studied in the field of blind speech separation [4]- [7].…”
Section: Introductionmentioning
confidence: 99%
“…A long-term objective in the field of audio processing research is to computationally emulate the selective hearing ability of humans. Early research in audio segregation primarily focused on speech separation, which is a special case of audio separation [2], [3]. With the advancements in deep learning, several deep neural network-based approaches have been studied in the field of blind speech separation [4]- [7].…”
Section: Introductionmentioning
confidence: 99%
“…The primary objective is to differentiate and recover the individual speeches by leveraging the available perceptual data, i.e., BSS of audio files [13]. As evident from the observations, the development of a robust framework capable of effectively separating speech and music has the potential to yield substantial benefits across numerous lucrative applications [14], [15].…”
Section: Introductionmentioning
confidence: 99%
“…One prominent signal processing technique in this context is BSS. Various BSS algorithms and architectures are investigated, considering their potential to improve interference management, channel estimation, beamforming, and resource allocation in these next-generation wireless networks [4], [15], [16], [17], [18], [19], [20].…”
Section: Introductionmentioning
confidence: 99%
“…Conventional deep-learning-based speech separation technologies are reviewed in [1], [2]. [3] describes the comprehensive outlook of the state-of-the-art technologies for speech separation and presents a detailed performance comparison. However, these approaches have not yet reached the levels of separation capability of auditory systems.…”
Section: Introductionmentioning
confidence: 99%