2014
DOI: 10.1109/taslp.2014.2355821
|View full text |Cite
|
Sign up to set email alerts
|

On speech features fusion, α-integration Gaussian modeling and multi-style training for noise robust speaker classification

Abstract: This paper investigates the fusion of Mel-frequency cepstral coefficients (MFCC) and statistical pH features to improve the performance of speaker verification (SV) in non-stationary noise conditions. The -integrated Gaussian Mixture Model ( -GMM) classifier is adopted for speaker modeling. Two different approaches are applied to reduce the effects of noise corruption in the SV task: speech enhancement and multi-style training (MT). The spectral subtraction with minimum statistics (MS/SS) and the optimally-mod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
11
0
5

Year Published

2015
2015
2019
2019

Publication Types

Select...
8
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 24 publications
(18 citation statements)
references
References 36 publications
2
11
0
5
Order By: Relevance
“…In the area of GMM-based speaker [14] and emotional voice classification [15] analysis in our experiment arises from the previous research [16,17]. First, the weighted frames of the input sentence are used to calculate the energy contour from the first cepstral coefficient.…”
Section: Determination Of Speech Featuresmentioning
confidence: 99%
“…In the area of GMM-based speaker [14] and emotional voice classification [15] analysis in our experiment arises from the previous research [16,17]. First, the weighted frames of the input sentence are used to calculate the energy contour from the first cepstral coefficient.…”
Section: Determination Of Speech Featuresmentioning
confidence: 99%
“…In the area of the GMM-based speaker [28]- [30], as well as the acoustic signal recognition [31], the most commonly used spectral features are mel-frequency cepstral coefficients together with energy and prosodic parameters. In our experiments the features differ for ANOVA-based evaluation and GMM-based classification of the speech signal quality.…”
Section: Determination Of Features Of the Speech Signalmentioning
confidence: 99%
“…Coloured, white, and narrow-band noises are applied to alpha-GMM classifier for multi-style training [30]. The goal of the multi-style training is to reduce the mismatch between training and test features by corrupting the speech signals.…”
Section: Robust Features Against Additive Noisementioning
confidence: 99%