2021
DOI: 10.1186/s40537-021-00435-9
|View full text |Cite
|
Sign up to set email alerts
|

Development of a regional voice dataset and speaker classification based on machine learning

Abstract: At present, voice biometrics are commonly used for identification and authentication of users through their voice. Voice based services such as mobile banking, access to personal devices, and logging into social networks are the common examples of authenticating users through voice biometrics. In Pakistan, voice-based services are very common in banking and mobile/cellular sector, however, these services do not use voice features to recognize customers. Therefore, the chance to use these services with false id… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 38 publications
0
2
0
Order By: Relevance
“…Secondly, the traditional bio-inspired algorithm known as Mel Frequency Cepstral Coefficients (MFCC) is commonly used in the context of speech recognition and speaker verification especially under noisy conditions since they are noise robust and invariant both to additive and convolutional noise [34][35][36]. The idea of MFCC is to capture the most salient features based on a nonlinear frequency scale derived from the human peripheral auditory system, which would reflect how our hearing perceives the VA signal in terms of spectral power density in Mel frequency bands.…”
Section: Feature Extractionmentioning
confidence: 99%
“…Secondly, the traditional bio-inspired algorithm known as Mel Frequency Cepstral Coefficients (MFCC) is commonly used in the context of speech recognition and speaker verification especially under noisy conditions since they are noise robust and invariant both to additive and convolutional noise [34][35][36]. The idea of MFCC is to capture the most salient features based on a nonlinear frequency scale derived from the human peripheral auditory system, which would reflect how our hearing perceives the VA signal in terms of spectral power density in Mel frequency bands.…”
Section: Feature Extractionmentioning
confidence: 99%
“…The accuracy evaluation metric was used to measure the proportion of correctly classified instances in the datasets. It shows how frequently the classifier predicts the correct values and gives the percentage of the samples which were correctly classified from all the samples given [39]. It was calculated as the number of correct predictions divided by the total number of predictions.…”
Section: Accuracymentioning
confidence: 99%