Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
DOI: 10.1109/icslp.1996.607291
|View full text |Cite
|
Sign up to set email alerts
|

Modeling segmental duration in German text-to-speech synthesis

Abstract: This paper reports on the construction of a model for segmental duration in German. The model predicts the durations of speech sounds in various textual, prosodic, and segmental contexts. It has been implemented in the German version of the Bell Labs text-tospeech system [18,12]. The construction of the duration system was made efficient by the use of an interactive statistical analysis package that incorporates the approach outlined in [23]. The results are stored in tables in a format that can be directly in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
33
0

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 35 publications
(34 citation statements)
references
References 16 publications
1
33
0
Order By: Relevance
“…Database description: During the human speech production procedure the positional and contextual factors of a phone (place in syllable and word) play a very important role in the assessment of its duration (Mobius and Santen, 1996;Santen, 1992). The database was designed following this statement, so as for each phone to have multiple instances in various positions in different words (initial, medial, final) in the database.…”
Section: Meta-learning Algorithmsmentioning
confidence: 99%
See 4 more Smart Citations
“…Database description: During the human speech production procedure the positional and contextual factors of a phone (place in syllable and word) play a very important role in the assessment of its duration (Mobius and Santen, 1996;Santen, 1992). The database was designed following this statement, so as for each phone to have multiple instances in various positions in different words (initial, medial, final) in the database.…”
Section: Meta-learning Algorithmsmentioning
confidence: 99%
“…Various features can be extracted from text for the task of phone duration modeling (Mobius and Santen, 1996;Santen, 1992). The feature set implemented for this task includes phonological, morphological, linguistic and syntactic attributes.…”
Section: Feature Setmentioning
confidence: 99%
See 3 more Smart Citations