2013 International Conference on Human Computer Interactions (ICHCI) 2013
DOI: 10.1109/ichci-ieee.2013.6887781
|View full text |Cite
|
Sign up to set email alerts
|

Text independent speaker recognition system using GMM

Abstract: Abstract-The idea of the AUDIO SIGNAL PROCESSING (Speaker Recognition [4] Project) is to implement a recognizer using Matlab which can identify a person by processing his/her voice. The Matlab functions and scripts were all well documented and parameterized in order to be able to use them in the future. The basic goal of our project is to recognize and classify the speeches of different persons. This classification is mainly based on extracting several key features like Mel Frequency Cepstral Coefficients (MF… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 7 publications
0
2
0
Order By: Relevance
“…This is because the GMM model is characterized by the parameters related averages and variance of data also allow modeling of data distribution with optional precision. In the same way, GMM proved appropriate for the problem of recognizing information contours such as speaker recognition, dialect recognition, language identification, emotion recognition, and music genre identification [61][62][63][64][65][66][67]. On the other hand, in terms of model implementation, GMM allows for training in a much shorter time than ANN, leading to unnecessary use of complex and expensive hardware configurations including GPUs.…”
Section: Gaussian Mixture Modelmentioning
confidence: 99%
“…This is because the GMM model is characterized by the parameters related averages and variance of data also allow modeling of data distribution with optional precision. In the same way, GMM proved appropriate for the problem of recognizing information contours such as speaker recognition, dialect recognition, language identification, emotion recognition, and music genre identification [61][62][63][64][65][66][67]. On the other hand, in terms of model implementation, GMM allows for training in a much shorter time than ANN, leading to unnecessary use of complex and expensive hardware configurations including GPUs.…”
Section: Gaussian Mixture Modelmentioning
confidence: 99%
“…It comes in two forms that speaker verification [7] aims to verify if two audio samples belong to the same speaker, whereas speaker identification [58] predicts the identity of the speaker given an audio sample. Speaker recognition has been approached by applying a variety of machine learning models [4,5,12], either standard or specifically designed, to speech features such as MFCC [34]. For many years, the GMM-UBM framework [44] dominated this particular field.…”
Section: Audio-based Speaker Identificationmentioning
confidence: 99%