2020
DOI: 10.1109/access.2020.3019495
|View full text |Cite
|
Sign up to set email alerts
|

Speech Source Separation Using Variational Autoencoder and Bandpass Filter

Abstract: Speech source separation is essential for speech-related applications because this process enhances the input speech signal for the main processing model. Most of the current approaches for this task focus on separating the speech of commonly high-frequency noises or a particular background sound. They cannot clear the signals which intersect with the human speech in its frequency range. To deal with this problem, we propose a hybrid approach combining a variational autoencoder (VAE) and a bandpass filter (BPF… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(6 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…In [ 116 ], the researchers used two data sets, TIMIT and VIVOS. TIMIT is a speech corpus for American English; it has 8 dialects and 630 speakers.…”
Section: Applicationsmentioning
confidence: 99%
“…In [ 116 ], the researchers used two data sets, TIMIT and VIVOS. TIMIT is a speech corpus for American English; it has 8 dialects and 630 speakers.…”
Section: Applicationsmentioning
confidence: 99%
“…In signal and speech processing, monaural speech source separation is challenging because it separates the target speaker from the mixture of speakers and the background noises and interferences in a single microphone recording. Speaker separation [10]- [13], speech enhancement [14], [15], and speech de-reverberation and denoising [16] come under single-channel source separation categories, as in Fig. 1.…”
Section: Introductionmentioning
confidence: 99%
“…Speaker separation allows extracting more than one speaker from the mixture of two and more than two speakers [10]- [13]. Speech enhancement improves noisy speech signals' intelligibility and perceptual quality [14], [15] and attempts to separate speech from noisy mixture signals.…”
Section: Introductionmentioning
confidence: 99%
“…B LIND source separation (BSS) refers to the process of separating and recovering signals by using only the observed signals when the source signals and the transmission channel are unknown [1]. After years of research and development, BSS has been successfully applied in many fields, including military communications [2], image processing [3], speech signal processing [4], and other fields. BSS is defined as underdetermined blind source separation (UBSS) when the number of observed signals is less than the number of source signals [5].…”
Section: Introductionmentioning
confidence: 99%