2009 IEEE International Conference on Acoustics, Speech and Signal Processing 2009
DOI: 10.1109/icassp.2009.4960569
|View full text |Cite
|
Sign up to set email alerts
|

Control of prosodic focus in corpus-based generation of fundamental frequency contours of Japanese based on the generation process model

Abstract: A method was developed for generating sentence F0 contours, when a focus is placed in one of bunsetsu of an utterance. The method is to predict differences in F0 model commands between with and without focus utterances, and applies them to the F0 model commands predicted beforehand by the baseline method. The validity of the method was proved by the experiment on F0 contour generation and speech synthesis.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2010
2010
2016
2016

Publication Types

Select...
5
2
1

Relationship

4
4

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 3 publications
0
7
0
Order By: Relevance
“…For instance, we have developed a corpus-based method to predict differences in F 0 model commands between two versions of utterances of the same linguistic content [17,18]. Applying the predicted differences to the baseline version of speech, the new version of speech can be realized.…”
Section: Discussionmentioning
confidence: 99%
“…For instance, we have developed a corpus-based method to predict differences in F 0 model commands between two versions of utterances of the same linguistic content [17,18]. Applying the predicted differences to the baseline version of speech, the new version of speech can be realized.…”
Section: Discussionmentioning
confidence: 99%
“…In the proposed method, generated F0 contours are represented as the sum of three contours, two of which are generated from HMM's trained using the phrase and accent components of the F0 model, and one from HMM's trained using F0 residuals. The extraction of F0 model commands is considered to be easy for the former two contours, leading to a flexible and systematic control of prosody [22,23].…”
Section: Hmm-based Speech Synthesismentioning
confidence: 99%
“…The Fujisaki model (Fujisaki, 1983) is another wellknown prosodic model in ESS (Chen et al, 2004;Kiriyama et al, 2002). Ochi et al (2009) used this model to control focus by modifying the Fujisaki model parameters. Prosodic variation caused by focus was investigated by considering the difference between utterances with and without focus.…”
Section: Introductionmentioning
confidence: 99%