2011
DOI: 10.3844/jcssp.2011.1310.1317
|View full text |Cite
|
Sign up to set email alerts
|

Modeling of Fundamental Frequency Contour of Thai Expressive Speech using Fujisaki's Model and Structural Model

Abstract:

Problem statement: In spontaneous speech communication, prosody is an important factor that must be taken into account, since the prosody effects on not only the naturalness but also the intelligibility of speech. Focusing on synthesis of Thai expressive speech, a number of systems has been developed for years. However, the expressive speech with various speaking styles has not been accomplished. To achieve the generation of expressive speech, we need to model the fundamental frequenc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2012
2012
2012
2012

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…Moreover, it reveals that the RMS error of the Fujisaki's model is higher than that of the structural model for all speech styles. In other words, the structural model gives the better fit for modeling of the F 0 contour of the expressive speech than that of the Fujisaki's model (Chomphan, 2011d).…”
Section: Resultsmentioning
confidence: 99%
“…Moreover, it reveals that the RMS error of the Fujisaki's model is higher than that of the structural model for all speech styles. In other words, the structural model gives the better fit for modeling of the F 0 contour of the expressive speech than that of the Fujisaki's model (Chomphan, 2011d).…”
Section: Resultsmentioning
confidence: 99%
“…2, the following charts are summarized (Chomphan, 2011b). First, the noise effects on the male-angry-style speech are summarized in terms of RMSE values with four different types of noises and five different levels of noises in Fig.…”
Section: Resultsmentioning
confidence: 99%
“…They are baseline frequency, number of phrase commands, number of tone commands, phrase command duration, tone command duration, amplitude of phrase command and amplitude of tone command. The derived output parameters are mostly extracted for Thai tones (Chomphan, 2011).…”
Section: Fujisaki's Modelmentioning
confidence: 99%