2018
DOI: 10.1007/s11265-018-1334-2
|View full text |Cite
|
Sign up to set email alerts
|

Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
5
0

Year Published

2018
2018
2025
2025

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 19 publications
(7 citation statements)
references
References 33 publications
2
5
0
Order By: Relevance
“…The experimental results have shown that at full data resolution, F0 contours both in Hertz and in semitones can achieve high tone recognition rates (86% and 97%, respectively). Although similar recognition rates were already shown in a previous study using the same corpus [29], the performance is not trivial, as these tones were produced in fluent connected speech in many different tonal contexts and two syllable positions [28], yet in the present study no contextual or positional information is used as input features during training and testing, contrary to the common practice in speech technology applications [30], [31] and [32]. This means that, despite the variability, tones produced in contexts by speakers of both genders still have enough in common to allow a pattern recognition algorithm (SVM) to accurately recognize the tonal categories.…”
Section: Discussionsupporting
confidence: 63%
“…The experimental results have shown that at full data resolution, F0 contours both in Hertz and in semitones can achieve high tone recognition rates (86% and 97%, respectively). Although similar recognition rates were already shown in a previous study using the same corpus [29], the performance is not trivial, as these tones were produced in fluent connected speech in many different tonal contexts and two syllable positions [28], yet in the present study no contextual or positional information is used as input features during training and testing, contrary to the common practice in speech technology applications [30], [31] and [32]. This means that, despite the variability, tones produced in contexts by speakers of both genders still have enough in common to allow a pattern recognition algorithm (SVM) to accurately recognize the tonal categories.…”
Section: Discussionsupporting
confidence: 63%
“…. Previous results showed that this method had many advantages in mispronunciation detection [30], which attested the reliability of the grouping method mentioned above.…”
Section: Datasupporting
confidence: 54%
“…Motivated by the success of deep learning technology, some deep learning models have been applied to tone recognition. [7,8] applies DNN to tone recognition on female corpus and some good results are achieved. More recently, [9] employs Convolutional Neural Network (CNN) for speech evaluation of the hearing-impaired population.…”
Section: Introductionmentioning
confidence: 99%