1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258) 1999
DOI: 10.1109/icassp.1999.758125
|View full text |Cite
|
Sign up to set email alerts
|

Improving a GMM speaker verification system by phonetic weighting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
26
0

Year Published

2000
2000
2011
2011

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(29 citation statements)
references
References 3 publications
3
26
0
Order By: Relevance
“…However, we need to fix the length of the test segment to be equal for all classes for fair comparison. Previous studies have fixed the length of the test segment [1], [3] or showed the results as an average likelihood versus the number of frames in each phoneme [2] for a similar reason. Thus, we also perform speaker identification test with a fixed test length.…”
Section: Controlled Condition: Same Test Lengthmentioning
confidence: 99%
See 1 more Smart Citation
“…However, we need to fix the length of the test segment to be equal for all classes for fair comparison. Previous studies have fixed the length of the test segment [1], [3] or showed the results as an average likelihood versus the number of frames in each phoneme [2] for a similar reason. Thus, we also perform speaker identification test with a fixed test length.…”
Section: Controlled Condition: Same Test Lengthmentioning
confidence: 99%
“…Fundamentally, they use the fact that some classes of phonemes, such as vowels and nasals, include more speaker-related information than obstruents such as stops, affricatives, and fricatives [1]. One of the approaches assigns different weights to each phoneme depending on how much each phoneme includes speaker related information [2]. Pelecanos et al found an optimal feature type and a frame length of each phoneme for speaker identification [3].…”
Section: Introductionmentioning
confidence: 99%
“…[4][5][6][7] In this paper, we also propose a method that utilizes phoneme information to improve speaker recognition performance. While previous studies focus on using separate models for each phoneme and combining scores, 4,6,7 we focus on finding an optimal phoneme class ratio, the portion of each phoneme class in an utterance, that maximizes speaker recognition performance based on mutual information. In speaker recognition, some researchers use this measurement to measure or improve speaker recognition accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…The Universal Background Model (UBM) approach [9] is the most popular normalized likelihood approach when utilizing Gaussian Mixture Model (GMM) classifier.…”
Section: Likelihood Normalizationmentioning
confidence: 99%
“…The background models were then adapted to speaker models using MAP adaptation [9]. The first two utterances for all speakers in the corpus being common were used for text dependent experiments and 6 different utterances for each speaker allowed text independent verification experiments to be conducted.…”
Section: Likelihood Normalizationmentioning
confidence: 99%