2020
DOI: 10.1007/s11831-020-09414-4
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Speech Recognition System for Tonal Languages: State-of-the-Art Survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3
1
1

Relationship

2
8

Authors

Journals

citations
Cited by 44 publications
(18 citation statements)
references
References 87 publications
0
18
0
Order By: Relevance
“…The purpose of this systematic survey is, to sum up, the best available research on automatic speech recognition of Indian languages. Kaur et al [147] reviewed the status of speech recognition research conducted on tonal languages spoken around the globe. Authors observed that a lot of work has been done for the Asian continent tonal languages, i.e., Chinese, Thai, Vietnamese, Mandarin, but little work been reported for the Mizo, Bodo, Indo-European tonal languages like Punjabi, Latvian, Lithuanian as well for the African continental tonal languages, i.e., Hausa and Yoruba (Tables 1, 2).…”
Section: Inclusion/exclusion Criteriamentioning
confidence: 99%
“…The purpose of this systematic survey is, to sum up, the best available research on automatic speech recognition of Indian languages. Kaur et al [147] reviewed the status of speech recognition research conducted on tonal languages spoken around the globe. Authors observed that a lot of work has been done for the Asian continent tonal languages, i.e., Chinese, Thai, Vietnamese, Mandarin, but little work been reported for the Mizo, Bodo, Indo-European tonal languages like Punjabi, Latvian, Lithuanian as well for the African continental tonal languages, i.e., Hausa and Yoruba (Tables 1, 2).…”
Section: Inclusion/exclusion Criteriamentioning
confidence: 99%
“…Alternatively, biometrics, recognising individuals from behavior and biological characteristics, 6 gained attention for authentication. This is attributed to biometric sensors included in smartphones, such as fingerprint [139] (pore and ridge structure [256]), voice [48] (mel frequency cepstral coding, today deep neural networks [144]), gait (heel-strike ratio [237] or cycle matching [200]), face (features learned in deep neural networks [134]), keystroke dynamics (key-press latencies [195]), or iris [290] (image intensity maps from Hough-transformed Daugman rubber sheet models [281]).…”
Section: Limitations Of Traditional Authentication Schemesmentioning
confidence: 99%
“…Many systems treat pitch (the main phonetic cue of tone) as a completely separate feature. In such systems, the traditional ASR algorithm learns the segments, and a separate machine learning module learns the pitch patterns and offers its inference of the tone (Kaur et al, 2020). This has been used for languages like Mandarin (Niu et al, 2013;Shan et al, 2010), Thai (Kertkeidkachorn et al, 2014) and Yoruba (O .…”
Section: Tonal Languages and Asrmentioning
confidence: 99%