2014
DOI: 10.1109/taslp.2014.2354236
|View full text |Cite
|
Sign up to set email alerts
|

STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement

Abstract: The enhancement of speech which is corrupted by noise is commonly performed in the short-time discrete Fourier transform domain. In case only a single microphone signal is available, typically only the spectral amplitude is modified. However, it has recently been shown that an improved spectral phase can as well be utilized for speech enhancement, e.g., for phase-sensitive amplitude estimation. In this paper, we therefore present a method to reconstruct the spectral phase of voiced speech from only the fundame… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
155
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 186 publications
(163 citation statements)
references
References 27 publications
2
155
0
Order By: Relevance
“…A summary of each of the 16 primary systems is provided in Table III and discussed below. [42], baseband phase difference (BPD) [43], and pitch synchronous phase (PSP). A multilayer perceptron (MLP) was trained for each feature.…”
Section: B Submissionsmentioning
confidence: 99%
“…A summary of each of the 16 primary systems is provided in Table III and discussed below. [42], baseband phase difference (BPD) [43], and pitch synchronous phase (PSP). A multilayer perceptron (MLP) was trained for each feature.…”
Section: B Submissionsmentioning
confidence: 99%
“…• BPD: Baseband phase difference [20] is a phase feature extracted from baseband STFT, which can also provide a clear pattern to present phase information.…”
Section: Feature Extractionmentioning
confidence: 99%
“…Alternatively, one can extract phase constraints from the sinusoidal model, which is widely used for representing audio signals [11,22]. It can be shown [23] that the STFT phase µ of a signal modeled as a sum of sinusoids in the time domain follows the phase unwrapping (PU) equation:…”
Section: Sinusoidal Modelmentioning
confidence: 99%
“…It has been used in many audio applications, including time stretching [23], speech enhancement [22] and source separation [7,11,24].…”
Section: Sinusoidal Modelmentioning
confidence: 99%