2016
DOI: 10.21512/comtech.v7i2.2252
|View full text |Cite
|
Sign up to set email alerts
|

Analysis And Voice Recognition In Indonesian Language Using MFCC And SVM Method

Abstract: Voice recognition technology is one of biometric technology. Sound is a unique part of the human being which made an individual can be easily distinguished one from another. Voice can also provide information such as gender, emotion, and identity of the speaker. This research will record human voices that pronounce digits between 0 and 9 with and without noise. Features of this sound recording will be extracted using Mel Frequency Cepstral Coefficient (MFCC). Mean, standard deviation, max, min, and the combina… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 3 publications
0
4
0
Order By: Relevance
“…In that case, the more delicate components of the Mel frequency spectral envelope can be represented. Still, the dimensionality of the feature vector increases, so MFCCs with less than 20 dimensions [20][21][22][23] or MFCCs and logarithmic energy are often used. 24 The MFCCs and its time-varying component Delta MFCCs, and also the time-varying component of Delta MFCCs, Delta delta MFCCs, are sometimes used.…”
Section: Compress the Amplitude Spectrum By Applying Amentioning
confidence: 99%
“…In that case, the more delicate components of the Mel frequency spectral envelope can be represented. Still, the dimensionality of the feature vector increases, so MFCCs with less than 20 dimensions [20][21][22][23] or MFCCs and logarithmic energy are often used. 24 The MFCCs and its time-varying component Delta MFCCs, and also the time-varying component of Delta MFCCs, Delta delta MFCCs, are sometimes used.…”
Section: Compress the Amplitude Spectrum By Applying Amentioning
confidence: 99%
“…Mel Frequency Cepstral Coefficients (MFCC) is one of the voice feature extraction techniques that are often used as in [9]- [12], MFCC is used to distinguish one sound from other sounds. By utilizing classification techniques, we can classify MFCC features from voice input to a class that we have specified.…”
Section: Related Workmentioning
confidence: 99%
“…However, the many variations of sound can cause the classification process to look for non-linear correlations. To solve non-linear correlations problem many researchers try to use machine learning techniques [9]- [16], and what is more promising is the classification using deep learning techniques. With this in mind, we decided to try the deep learning approach (Convolutional Neural Network) on the speech recognition module in our educational game application.…”
Section: Related Workmentioning
confidence: 99%
“…Meanwhile, Harvianto et. al [30] analyse the voice of Indonesia Language using MFCC and SVM achieved high accuracy which is 91.83%.…”
Section: Introductionmentioning
confidence: 99%