2008
DOI: 10.1007/978-3-540-87391-4_63
|View full text |Cite
|
Sign up to set email alerts
|

Prosodic Phrases and Semantic Accents in Speech Corpus for Czech TTS Synthesis

Abstract: Abstract. We describe a statistical method for assignment of prosodic phrases and semantic accents in read speech data. The method is based on statistical evaluation of listening test data by a maximum-likelihood approach with parameters estimated by an EM algorithm. We also present linguistically relevant quantitative results about the prosodic phrase and semantic accent distribution in 250 Czech sentences.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 4 publications
0
3
0
Order By: Relevance
“…Emotional TTS synthesis will be evaluated separately from the whole dialogue system by listening tests. These listening tests will follow the scheme we have developed for prosodic phrase and semantic accent annotation, including statistical modeling of the results using the maximum likelihood approach [8]. The overall naturalness of audiovisual experience resulting from the TTS and avatar activity can be measured only indirectly by intersubjective assessments of testing users from among seniors.…”
Section: Discussionmentioning
confidence: 99%
“…Emotional TTS synthesis will be evaluated separately from the whole dialogue system by listening tests. These listening tests will follow the scheme we have developed for prosodic phrase and semantic accent annotation, including statistical modeling of the results using the maximum likelihood approach [8]. The overall naturalness of audiovisual experience resulting from the TTS and avatar activity can be measured only indirectly by intersubjective assessments of testing users from among seniors.…”
Section: Discussionmentioning
confidence: 99%
“…It is an asymptotically consistent, asymptotically normal and asymptotically efficient estimate. This approach was also successfully used in other works regarding speech synthesis research, see [46].…”
Section: Objective Annotationmentioning
confidence: 97%
“…It is an asymptotically consistent, asymptotically normal and asymptotically efficient estimate. We have also successfully used this approach in recent works regarding speech synthesis research, see [8].…”
Section: Objective Annotationmentioning
confidence: 99%