2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763)
DOI: 10.1109/icme.2004.1394650
|View full text |Cite
|
Sign up to set email alerts
|

Automatic pronunciation assessment for Mandarin Chinese

Abstract: This paper presents the algorithms used in a prototypical software system for automatic pronunciation assessment of Mandarin Chinese. The system uses forced alignment of HMM (Hidden Markov Models) to identify each syllable and the corresponding log probability for phoneme assessment, through a ranking-based confidence measure. The pitch vector of each syllable is then sent to a GMM (Gaussian Mixture Model) for tone recognition and assessment. We also compute the similarity of scores for intensity and rhythm be… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(9 citation statements)
references
References 8 publications
0
9
0
Order By: Relevance
“…Therefore, the mandarin automatic pronunciation quality assessment system includes three subsystems, and the block diagram is shown in figure 1. For the tone, we use a pre-trained Gaussian Mixture Model (GMM) classifier based on tone recognition technology to assess the tone accuracy, and its detail can be found in [12] [13]. For consonants and vowels, we use the traditional ASR techniques of Hidden Markov Model (HMM) and Viterbi search, which is the kernel of the system.…”
Section: Figure 1 Mandarin Pronunciation Assessment System Diagrammentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore, the mandarin automatic pronunciation quality assessment system includes three subsystems, and the block diagram is shown in figure 1. For the tone, we use a pre-trained Gaussian Mixture Model (GMM) classifier based on tone recognition technology to assess the tone accuracy, and its detail can be found in [12] [13]. For consonants and vowels, we use the traditional ASR techniques of Hidden Markov Model (HMM) and Viterbi search, which is the kernel of the system.…”
Section: Figure 1 Mandarin Pronunciation Assessment System Diagrammentioning
confidence: 99%
“…Therefore, the performance of pronunciation quality assessment system mainly depends on how precise is the confidence computed for the speech segments. Many researches about CALL systems based on ASR framework focus on the pronunciation quality assessment of speech segment [5][6] [7][8] [9] [11] [12] [13], and seek to explore a more efficient confidence of pronunciation quality. So far, the confidences include log likelihood score, log posteriori probability, segmental classification score, segmental duration score, fluency score, and so on.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The first two parts are evaluated using the ASR techniques of HMM and Viterbi search [7], which is the core of the system. The tone quality is evaluated using a Gaussian Mixture Model (GMM) classifier, which is not the focus of this paper, but can be found in [8] [10]. A block diagram of the system is shown in Fig.1.…”
Section: Call System Overviewmentioning
confidence: 99%
“…The computer Aided Language Learning (CALL) system [1]- [8]can automatically score the quality of human speech ,and provide continuous feedback to learners without requiring the sole attention from the teacher. So it facilitates self study and encourage interactive use of the language rather than rote learning.…”
Section: Introductionmentioning
confidence: 99%