2019
DOI: 10.3389/fcomm.2019.00064
|View full text |Cite
|
Sign up to set email alerts
|

Dialect Classification From a Single Sonorant Sound Using Deep Neural Networks

Abstract: During spoken communication, the fine acoustic properties of human speech can reveal vital sociolinguistic and linguistic information about speakers and thus, these properties can function as reliable identification markers of speakers' identity. One key piece of information speech reveals is speakers' dialect. The first aim of this study is to provide a machine learning method that can distinguish the dialect from acoustic productions of sonorant sounds. The second aim is to determine the classification accur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 49 publications
0
6
0
Order By: Relevance
“…The onset and offset of the first two vowel formants (F1 and F2) and the fundamental frequency (F0) were employed for the identification of vowels [ 72 , 73 ]. The onset and offset of frication (i.e., the noisy portion) was employed for the identification of fricatives [ 74 , 75 ]. Stop consonants were measured at the onset of the closure phase, including the burst [ 76 ].…”
Section: Methodsmentioning
confidence: 99%
“…The onset and offset of the first two vowel formants (F1 and F2) and the fundamental frequency (F0) were employed for the identification of vowels [ 72 , 73 ]. The onset and offset of frication (i.e., the noisy portion) was employed for the identification of fricatives [ 74 , 75 ]. Stop consonants were measured at the onset of the closure phase, including the burst [ 76 ].…”
Section: Methodsmentioning
confidence: 99%
“…Additional tools incorporate IPA transcription and acoustic analysis tools. Open Brain AI enables end-to-end spoken and written speech production analysis by combining the different computational pipelines to provide automated and objective linguistic measures (1,16,18,19,24,26,31,32).…”
Section: Technologiesmentioning
confidence: 99%
“…For example, Themistocleous, Webster (19) analyzed connected speech productions from 52 individuals with PPA using a morphological tagger and showed differences in POS production in patients with non-fluent Primary Progressive Aphasia (nfvPPA), logopenic variant of Primary Progressive Aphasia (lvPPA), and the semantic variant of Primary Progressive Aphasia (svPPA). Also, we have employed machine learning to identify speakers with different dialects, from speech acoustics, namely prosody (20)(21)(22), vowels (23,24), and consonants (21,(25)(26)(27)(28). Machine learning was used to track the learning of L1 dialectal learners of the standard language variety in classrooms (29), showing the implication of using machine learning applications in diverse populations.…”
Section: Introductionmentioning
confidence: 99%
“…Finally, Themistocleous (2019) describes classification approaches based on deep neural networks for distinguishing two Greek dialects, namely Athenian Greek, the prototypical form of Standard Modern Greek and Cypriot Greek. That work is based on the acoustic features of spoken language.…”
Section: Background and Related Workmentioning
confidence: 99%