2019
DOI: 10.1134/s1064226919120155
|View full text |Cite
|
Sign up to set email alerts
|

Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 26 publications
0
5
0
Order By: Relevance
“…This paper emphasize the speech magnitude enhancement. We will be devoted to include the phase estimation [ 57 ] and incorporate with the proposed SE model. Moreover, more robust loss functions will be worked out for better results.…”
Section: Discussionmentioning
confidence: 99%
“…This paper emphasize the speech magnitude enhancement. We will be devoted to include the phase estimation [ 57 ] and incorporate with the proposed SE model. Moreover, more robust loss functions will be worked out for better results.…”
Section: Discussionmentioning
confidence: 99%
“…In the AEC system, the adaptive filter is employed to identify the room IR. The filter needs to be updated continuously, because the characteristics of the room vary in time with the movement of people and objects [4,5]. In Figure 1, 𝑠 𝑏 (𝑛) is convoluted by IR ℎ 𝑎 (𝑛) of room, and then captured, simultaneously, by a microphone with the 𝑠 𝑎 (𝑛).…”
Section: Acoustic Echo Cancellation Systemmentioning
confidence: 99%
“…Recently, several research papers have been published based on adaptive filtering algorithms, which have been implemented on several telecommunication fields, such as acoustic echo and noise reduction to accelerate the convergence adaptation and enhancing the quality of conversations [1][2][3]. Adaptive filtering algorithms are largely used in acoustic echo cancellation [4][5][6]. Several researches have been conducted to address the identification problem in the time domain and in the frequency domain [7,8].…”
Section: Introductionmentioning
confidence: 99%
“…The findings revealed that gammatone features is one of the outperform features. Various speech dataset such as IEEE [10,14,15,17] and TIMIT [18] were used to analyze the performance, which normally in short duration of speech utterance. A study in [10] also compared the training targets in DNN algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…Some studies proposed a new target mask in the DNN such as less aggressive Wiener filtering, phase aware, and complex ratio mask [20][21][22]. Other studies also proposed an alternative approach using postfiltering techniques such as global variance equalization [17,24] after the DNN-based mask estimation and speech reconstruction.…”
Section: Related Workmentioning
confidence: 99%