2023
DOI: 10.1109/access.2023.3313972
|View full text |Cite
|
Sign up to set email alerts
|

A Robust Hybrid Neural Network Architecture for Blind Source Separation of Speech Signals Exploiting Deep Learning

Sam Ansari,
Khawla A. Alnajjar,
Tarek Khater
et al.

Abstract: In the contemporary era, blind source separation has emerged as a highly appealing and significant research topic within the field of signal processing. The imperative for the integration of blind source separation techniques within the context of beyond fifth-generation and sixth-generation networks arises from the increasing demand for reliable and efficient communication systems that can effectively handle the challenges posed by high-density networks, dynamic interference environments, and the coexistence … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 89 publications
0
5
0
Order By: Relevance
“…In our investigative analysis, we seamlessly integrated the HPSS (Harmonic-Percussive Source Separation) algorithm [29] for input Log-Mel spectrogram processing. This method enabled us to efficiently divide the spectrograms into two fundamental constituents: harmonic spectrograms and percussive spectrograms.…”
Section: Harmonic-percussive Source Separationmentioning
confidence: 99%
“…In our investigative analysis, we seamlessly integrated the HPSS (Harmonic-Percussive Source Separation) algorithm [29] for input Log-Mel spectrogram processing. This method enabled us to efficiently divide the spectrograms into two fundamental constituents: harmonic spectrograms and percussive spectrograms.…”
Section: Harmonic-percussive Source Separationmentioning
confidence: 99%
“…N t , x ( ) M t represents the source signal and observation signal, respectively, n ( ) M t represents the interference signal in the transmission channel, ŝ ( ) N t represents the estimated source signal, N is the number of source signals, and M is the number of mixing channels. Variations in mixing modes, the number of mixing channels, and the relationships between the number of sources lead to different blind source systems, each requiring distinct research approaches [12]. Below, we will introduce them separately:…”
Section: S ( )mentioning
confidence: 99%
“…To analyze the impact of different TFA methods on the accuracy of mixed matrix estimation, this paper employs the Normalized Mean Square Error (NMSE) to evaluate the estimation accuracy of the mixed matrix. The mathematical expression for NMSE is provided as follows [12]:…”
Section: ) Mixed Matrixmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, the Transformer is widely used in BSS for mixing signals. For example, [44] proposes a three-way architecture that incorporates a pre-trained dual-path recurrent neural network and Transformer. A Transformer network-based planewave domain masking approach is utilized to retrieve the reverberant ambisonic signals from a multichannel recording in [45].…”
Section: Introductionmentioning
confidence: 99%