2023
DOI: 10.13064/ksss.2023.15.2.043
|View full text |Cite
|
Sign up to set email alerts
|

Dialect classification based on the speed and the pause of speech utterances*

Abstract: In this paper, we propose an approach for dialect classification based on the speed and pause of speech utterances as well as the age and gender of the speakers. Dialect classification is one of the important techniques for speech analysis. For example, an accurate dialect classification model can potentially improve the performance of speaker or speech recognition. According to previous studies, research based on deep learning using Mel-Frequency Cepstral Coefficients (MFCC) features has been the dominant app… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 34 publications
0
1
0
Order By: Relevance
“…Backend algorithms show better performance when training and test utterance lengths are equal for longer speech segments, and using medium-length utterances for training and longer ones for testing works well for female datasets. A study was conducted by Jonghwan Na․ and Bowon Lee (Na & Lee, 2023) introduced a novel approach for dialect classification in speech analysis. The method incorporates acoustic differences between regions, extracting additional features like speech speed, pauses, age, and gender of speakers.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Backend algorithms show better performance when training and test utterance lengths are equal for longer speech segments, and using medium-length utterances for training and longer ones for testing works well for female datasets. A study was conducted by Jonghwan Na․ and Bowon Lee (Na & Lee, 2023) introduced a novel approach for dialect classification in speech analysis. The method incorporates acoustic differences between regions, extracting additional features like speech speed, pauses, age, and gender of speakers.…”
Section: Literature Reviewmentioning
confidence: 99%