2016
DOI: 10.1186/s13634-015-0300-4
|View full text |Cite
|
Sign up to set email alerts
|

Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation

Abstract: This paper investigates deep neural networks (DNN) based on nonlinear feature mapping and statistical linear feature adaptation approaches for reducing reverberation in speech signals. In the nonlinear feature mapping approach, DNN is trained from parallel clean/distorted speech corpus to map reverberant and noisy speech coefficients (such as log magnitude spectrum) to the underlying clean speech coefficients. The constraint imposed by dynamic features (i.e., the time derivatives of the speech coefficients) ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
32
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 41 publications
(32 citation statements)
references
References 46 publications
0
32
0
Order By: Relevance
“…In previous studies, traditional fully connected DNNs were used to perform dereverberation [21][22][23]. More recently, the highway strat- egy has been popularly used and shown to provide improved performance [29].…”
Section: Highway-ddae Dereverberation Systemmentioning
confidence: 99%
See 2 more Smart Citations
“…In previous studies, traditional fully connected DNNs were used to perform dereverberation [21][22][23]. More recently, the highway strat- egy has been popularly used and shown to provide improved performance [29].…”
Section: Highway-ddae Dereverberation Systemmentioning
confidence: 99%
“…Channel-inversion methods belong to the third category, which considers the reverberation as a convolution of the original sound with the room impulse response (RIR) and thereby performs an inverse filtering to deconvolve the captured signal [15][16][17][18][19][20]. Even though the above three categories of approaches have been shown to provide satisfactory performance, they usually require an accurate estimation of time-varied RIR, which may not always be accessible in practice [21].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…These methods process the reverberant signal by estimating the magnitude spectrogram of the late reverberation and remove it from the magnitude spectrogram of the input signal. Suppression methods are based on a wide variety of approaches, such as a stochastic model of the RIR [5], a linear prediction model of the speech [6], a non-negative matrix factorization approach [7], or more recently on deep neural networks [8].…”
Section: Introductionmentioning
confidence: 99%
“…This is the main drawback of these methods, because using this corrupted phase reintroduces reverberation and distortion in the signal, as shown in [8]. In the source separation literature, the idea of modeling the phase has recently been proposed [9] because a similar problem occurs (the phase of the mixture is generally used to synthesize the source signals), but not for dereverberation.…”
Section: Introductionmentioning
confidence: 99%