2009 Canadian Conference on Electrical and Computer Engineering 2009
DOI: 10.1109/ccece.2009.5090157
|View full text |Cite
|
Sign up to set email alerts
|

Distributed automatic text-independent speaker identification using GMM-UBM speaker models

Abstract: The ETSI "Aurora" is a digit-based standard developed for distributed speech recognition (DSR) over telephone communication channels. This paper introduces a digitbased text-independent distributed speaker identification (DSID) system over telephone channels within the DSR framework. In this DSID system, the hypothesized speaker model is derived by GMM-UBM model training using Aurora2 connected digit training speech data and maximum a posteriori (MAP) adaptation. The UBM technique for speaker models is incorpo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 2 publications
0
5
0
Order By: Relevance
“…The rapid operation performance of the PSO-FFNN model has been reported since the FFNN model will be totally relying on the PSO algorithm for producing the weight coefficients without the need of performing standalone (internal) weight generation. Eventually, the mean square error (MSE) and root mean square error (RMSE) metrics are also found to be minimal [30][31][32]. MSE and RMSE metrics imply lesser error existence on the proposed model predictions.…”
Section: Resultsmentioning
confidence: 99%
“…The rapid operation performance of the PSO-FFNN model has been reported since the FFNN model will be totally relying on the PSO algorithm for producing the weight coefficients without the need of performing standalone (internal) weight generation. Eventually, the mean square error (MSE) and root mean square error (RMSE) metrics are also found to be minimal [30][31][32]. MSE and RMSE metrics imply lesser error existence on the proposed model predictions.…”
Section: Resultsmentioning
confidence: 99%
“…Kmeans algorithm and LBG algorithm belong to a sort of local clustering arithmetic, and the cluster result is always not global optimum under the condition of the abundant data and the fixed-setting classification number. The parameter model trained based on the cluster result has great effect on the identifying rate of the speaker identification system [4,7]. For this reason, an ant colony algorithm combined with Genetic arithmetic is proposed in the paper to improve the identifying rate.…”
Section: A Choice Of Methodsmentioning
confidence: 99%
“…[1,3]. The voiceprint recognition application of Gaussian Mixture Model (GMM) is becoming more and more popular after the 1990s because of its easiness, agility, availability and high robustness [4,14]. The voiceprint recognition technology is improved in the initialization method for the speaker model parameters training based on expectation maximization (EM) in order to raise its recognition rate in the paper.…”
Section: Introductionmentioning
confidence: 99%
“…The hybrid model [18] has been developed recently for speaker identification, in which two techniques, i.e., Gaussian Mixture Model (GMM) [19] and k-means clustering [20], are combined. This hybrid model can derive the precise model based on GMM extracted features for the particular speaker data, and it has been discovering the speaker clusters.…”
Section: Literature Studymentioning
confidence: 99%