2021
DOI: 10.14203/jet.v21.19-26
|View full text |Cite
|
Sign up to set email alerts
|

Speech Enhancement Using Deep Learning Methods: A Review

Abstract: Speech enhancement, which aims to recover the clean speech of the corrupted signal, plays an important role in the digital speech signal processing. According to the type of degradation and noise in the speech signal, approaches to speech enhancement vary. Thus, the research topic remains challenging in practice, specifically when dealing with highly non-stationary noise and reverberation. Recent advance of deep learning technologies has provided great support for the progress in speech enhancement research fi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 25 publications
(7 citation statements)
references
References 43 publications
0
7
0
Order By: Relevance
“…Generally, other evaluations of systems that perform speech separation or speech enhancements have been performed using simulated databases [72], which makes it more difficult to assess the extent to which these approaches can be applied in the real world. Moreover, it is even more difficult to find studies that assess the performance of the models under real dynamic conditions such as in mobile HRI scenarios.…”
Section: Human-robot Interactionmentioning
confidence: 99%
“…Generally, other evaluations of systems that perform speech separation or speech enhancements have been performed using simulated databases [72], which makes it more difficult to assess the extent to which these approaches can be applied in the real world. Moreover, it is even more difficult to find studies that assess the performance of the models under real dynamic conditions such as in mobile HRI scenarios.…”
Section: Human-robot Interactionmentioning
confidence: 99%
“…DNNs have shown better performance at modeling complex nonlinearity and perform better on SE in severe noisy backgrounds that are highly non-stationary. SE has made large progress due to the recent growth of DNNs [15]. The study [16] compressed the models to reduce DNN size, regressionbased DNNs [17] are proposed for SE, DNN-based SE is proposed to examine performance across different speech datasets [18], time-frequency-based training objective is proposed for SE [19], and generalized gaussian distributions are VOLUME 4, 2016 proposed for SE using regression-based DNN [20].…”
Section: Introductionmentioning
confidence: 99%
“…Deep learning was defined by [5] as models composed of multiple processing layers that learn representations of data with various levels of abstraction. These models have shown an exemplary performance in speech enhancement [6].…”
Section: Introductionmentioning
confidence: 99%