2020
DOI: 10.11591/ijeecs.v19.i1.pp207-214
|View full text |Cite
|
Sign up to set email alerts
|

Speaker ethnic identification for continuous speech in malay language using pitch and MFCC

Abstract: <span>Voice recognition has evolved exponentially over the years. The purpose of voice recognition or sometimes called speaker identification, is to identify the person who is speaking. This can be done by extracting features of speech that differ between individuals due to physiology (shape and size of the mouth and throat) and also behavioral patterns (pitch, accent and style of speaking). This paper explains an approach of voice recognition to identify the ethnicity of Malaysian people. Pitch and 13 M… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 13 publications
0
4
0
Order By: Relevance
“…Accuracy is the ratio of the number of cases that were predicted with correct answers compared to the total number of cases, while recall is the ratio of the number of positive cases that were predicted correctly to the number of positive cases that were predicted. Accuracy, precision, recall and F1-score measure can be calculated using the (6) to (9), respectively. Where TP, TN, FP, and FN stand for true positive, true negative, false positive, and false negative, respectively.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Accuracy is the ratio of the number of cases that were predicted with correct answers compared to the total number of cases, while recall is the ratio of the number of positive cases that were predicted correctly to the number of positive cases that were predicted. Accuracy, precision, recall and F1-score measure can be calculated using the (6) to (9), respectively. Where TP, TN, FP, and FN stand for true positive, true negative, false positive, and false negative, respectively.…”
Section: Discussionmentioning
confidence: 99%
“…In several studies regarding speaker recognition multi ethnic, the problem of limited data is an initial challenge in conducting research such as the study conducted by Hanifa et al [9], which only had 62 recorded data of Malaysian speakers, and the study of Cole [10] to identify speakers in South East England with 227 speakers. To solve the problem of limited data, various approaches are used, among others, by using ata augmentation (DA).…”
Section: Introductionmentioning
confidence: 99%
“…This accuracy is higher than the traditional GMM. Another approach was employed in a study ( Hanifa, Isa & Mohamad, 2020 ) that used 52 recordings of Malaysian recorded samples utilizing the MFCC in the feature extraction, with an accuracy of 57%. Along with machine learning, numerous works in SER and multi-racial utilize the DL technique, regarded as a rigorous approach to SER.…”
Section: Related Workmentioning
confidence: 99%
“…It is a part of biometrics that might be utilized for distinguishing proof, check, and recognition of individual speakers, with the capacity of detection, tracking, and segmentation by Indonesian J Elec Eng & Comp Sci ISSN: 2502-4752  Arabic speaker recognition using HMM (Jabbar S. Hussein) 1213 extension. Speaker recognition and speaker check structure a bigger control of speaker classification [13]. Speaker recognition attempts to figure out which speaker produced a discourse signal though speaker check affirms if the part of the discourse has a place with the person who allegation it.…”
Section: Introductionmentioning
confidence: 99%