2015
DOI: 10.1007/s10772-015-9295-3
|View full text |Cite
|
Sign up to set email alerts
|

i-Vectors in speech processing applications: a survey

Abstract: In the domain of speech recognition many methods have been proposed over time like Gaussian mixture models (GMM), GMM with universal background model (GMM-UBM framework), joint factor analysis, etc. i-Vector subspace modeling is one of the recent methods that has become the state of the art technique in this domain. This method largely provides the benefit of modeling both the intra-domain and inter-domain variabilities into the same low dimensional space. In this survey, we present a comprehensive collection … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0
1

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 44 publications
(22 citation statements)
references
References 83 publications
0
21
0
1
Order By: Relevance
“…On the other hand, in the SITW database, 70% from FWMFCC features was fused with 30% from the corresponding CMVNPNCC features. The Future work will consider a similar extensive evaluation for a speaker identification system built from an I-vector approach [4].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, in the SITW database, 70% from FWMFCC features was fused with 30% from the corresponding CMVNPNCC features. The Future work will consider a similar extensive evaluation for a speaker identification system built from an I-vector approach [4].…”
Section: Discussionmentioning
confidence: 99%
“…According to [4], feature extraction within speaker identification should be less influenced by noise or the person's health. However, to improve the speaker identification accuracy (SIA), Mel frequency cepstral coefficients (MFCC) features were fused with inverse MFCC features (IMFCC) in [5], but the approach was limited by the number of GMM components.…”
Section: Introductionmentioning
confidence: 99%
“…Lowdimensional i-vector w is a dense representation of all relevant discrimination information. The backend of this classifier was a simplified GPLDA algorithm [7]. We used a log likelihood ratio of probabilities of two hypotheses.…”
Section: Classifiers Gaussian Mixture Models (Baseline Classifier)mentioning
confidence: 99%
“…The main idea behind our system is the fusion of scores [4] given by a set of binary classification algorithms to obtain better classification results. We opted for some common speaker modelling techniques such as Gaussian mixture models (GMM) [5], GMM based universal background models (UBM) and i-vectors [6,7]. The set of classifiers consists of multiple GMMs based classifier [1,5], i-vector-Gaussian Probabilistic Linear Discriminant Analysis (GPLDA) [7] and highly popular in machine learning community tree boosting method XGBoost [8].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation