IEEE International Conference on Acoustics Speech and Signal Processing 2002
DOI: 10.1109/icassp.2002.5743809
|View full text |Cite
|
Sign up to set email alerts
|

Short-time Gaussianization for robust speaker verification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
52
0
1

Year Published

2006
2006
2013
2013

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 91 publications
(53 citation statements)
references
References 5 publications
0
52
0
1
Order By: Relevance
“…For telephone speech, the VAD labels are produced by a Hungarian phoneme recognizer [33,34] and for microphone speech, VAD labels are generated using a GMM-based VAD by training one GMM for nonspeech and another one for speech [35]. Final features are obtained after appending the delta and double delta features and normalizing the features using a short-time Gaussianization (STG) method [24,40].…”
Section: Multi-taper Mfcc and Plp Feature Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…For telephone speech, the VAD labels are produced by a Hungarian phoneme recognizer [33,34] and for microphone speech, VAD labels are generated using a GMM-based VAD by training one GMM for nonspeech and another one for speech [35]. Final features are obtained after appending the delta and double delta features and normalizing the features using a short-time Gaussianization (STG) method [24,40].…”
Section: Multi-taper Mfcc and Plp Feature Extractionmentioning
confidence: 99%
“…Nonspeech frames are then removed using pre-computed VAD labels using algorithms mentioned in section 3. For feature normalization, we apply the short-time Gaussianization (STG) technique [24,40] over a 300-frame window.…”
Section: Feature Extractionmentioning
confidence: 99%
“…For the acoustic GMM-UBM system [1], we applied several channel compensation techniques, including feature warping [40], Z-norm [41], short-time Gaussianization (STG) [42] and fast blind stochastic feature transformation (fBSFT) [43]. Acoustic scores S GMM-UBM were computed based on the loglikelihood ratio:…”
Section: Scoring Methodsmentioning
confidence: 99%
“…Feature warping has been shown to be robust to different channel and noise effects. In short-time Gaussianization [10], a linear transformation is applied to the distorted feature before mapping them to a normal distribution. The transformation aims to decorrelate the feature vectors, making them more amendable to diagonal covariance GMMs.…”
Section: Blind Compensationmentioning
confidence: 99%