2019
DOI: 10.1109/taslp.2019.2934319
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning for Talker-Dependent Reverberant Speaker Separation: An Empirical Study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
12
0

Year Published

2020
2020
2025
2025

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 34 publications
0
12
0
Order By: Relevance
“…Dereverberation algorithms such as the weighted prediction error (WPE) method [16] are crucial for good performance. Systems for combined separation and dereverberation of a target speaker have been proposed in [17,18]. A recurrent neural network for joint separation and dereverberation is trained to learn the clean target signal in the time-frequency domain from the reverberant mixture [17].…”
Section: Introductionmentioning
confidence: 99%
“…Dereverberation algorithms such as the weighted prediction error (WPE) method [16] are crucial for good performance. Systems for combined separation and dereverberation of a target speaker have been proposed in [17,18]. A recurrent neural network for joint separation and dereverberation is trained to learn the clean target signal in the time-frequency domain from the reverberant mixture [17].…”
Section: Introductionmentioning
confidence: 99%
“…The main motivation is that jointly separating and enhancing may be too difficult for a single network to learn, and modularization may allow the networks to focus on specific tasks. Two-stage approaches have previously been explored for denoising plus dereverberation [19,20], separation plus dereverberation [21], and denoising plus separation [11].…”
Section: Cascaded Modelsmentioning
confidence: 99%
“…When W is fixed, each element of S j will be fixed at s j (f, n) = w H j (f )x(f, n). Now, since the terms that depend on z j and c j in (16) are given as…”
Section: Fmvae Algorithm a Ideamentioning
confidence: 99%
“…Meanwhile, given the recent advances achieved by deep VOLUME 4, 2016 neural network (DNN)-based speaker separation methods, including deep clustering (DC) [9], [10] and permutation invariant training (PIT) [11], [12], a discriminative approach has recently proved powerful in monaural source separation tasks, including both speaker-dependent and -independent scenarios [13]- [16]. The general idea is to train a DNN that predicts TF masks or TF embeddings from a given mixture signal based on spectro-temporal features.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation