Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-717
|View full text |Cite
|
Sign up to set email alerts
|

DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification

Abstract: The importance of the phase information of speech signal is gathering attention. Many researches indicate system combination of the amplitude and phase features is effective for improving speaker recognition performance under noisy environments. On the other hand, speech enhancement approach is taken usually to reduce the influence of noises. However, this approach only enhances the amplitude spectrum, therefor noisy phase spectrum is used for reconstructing the estimated signal. Recent years, DNN based featur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
14
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
6
2
2

Relationship

1
9

Authors

Journals

citations
Cited by 27 publications
(14 citation statements)
references
References 10 publications
0
14
0
Order By: Relevance
“…To compensate for these adverse impacts, various approaches have been proposed at different stages of the ASV systems. At the signal level, DNN based speech or feature enhancement [4,5,6,7] has been investigated for ASV under complex environment. At the feature level, feature normalization techniques [8] and noise-robust features such as power-normalized cepstral coefficients (PNCC) [9] have also been applied to ASV systems.…”
Section: Introductionmentioning
confidence: 99%
“…To compensate for these adverse impacts, various approaches have been proposed at different stages of the ASV systems. At the signal level, DNN based speech or feature enhancement [4,5,6,7] has been investigated for ASV under complex environment. At the feature level, feature normalization techniques [8] and noise-robust features such as power-normalized cepstral coefficients (PNCC) [9] have also been applied to ASV systems.…”
Section: Introductionmentioning
confidence: 99%
“…At the signal level, linear prediction inverse modulation transfer function [7] and weighted prediction error (WPE) [8,9] methods have been used for dereverberation. DNN based denoising methods for single-channel speech enhancement [10,11,12,13] and beamforming for multi-channel speech enhancement [8,14,15] have also been explored for ASV system under complex environments. At the feature level, sub-band Hilbert envelopes based features [16,17,18], warped minimum variance distortionless response (MVDR) cepstral coefficients [19], blind spectral weighting (BSW) based features [17], power-normalized cepstral coefficients (PNCC) [20] and DNN bottleneck features [21] have been applied to ASV system to suppress the adverse impacts of reverberation and noise.…”
Section: Introductionmentioning
confidence: 99%
“…We also thank Weixiang Hu, Yu Lu, Zexin Liu and Lei Miao from Huawei Digital Technologies Co., Ltd, China. been proposed at different stages of the speaker recognition system. At the signal level, dereverberation [6], denoising [7,8,9,10], and beamforming [11,12] can be used for speech enhancement. At feature level, sub-band Hilbert envelopes based features [13,14], warped minimum variance distortionless response (MVDR) cepstral coefficients [15], blind spectral weighting (BSW) based features [16] have been applied to ASV system to suppress the adverse impacts of reverberation and noise.…”
Section: Introductionmentioning
confidence: 99%