2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
DOI: 10.1109/icassp.2001.941043
|View full text |Cite
|
Sign up to set email alerts
|

Speech synthesis using stochastic Markov graphs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 8 publications
0
5
0
Order By: Relevance
“…This enables an inverse function of the recognizer system, offering the potential of applying analysis-by-synthesis approach to improve performance of speech recognition systems. Synthesis experiments, as well as recognition experiments, confirmed the suitability of this model for both tasks [16]. The system is used to create artificial adaptation data, and later itself is adapted on the transformed data (coding, noise etc.)…”
Section: Introductionmentioning
confidence: 58%
See 3 more Smart Citations
“…This enables an inverse function of the recognizer system, offering the potential of applying analysis-by-synthesis approach to improve performance of speech recognition systems. Synthesis experiments, as well as recognition experiments, confirmed the suitability of this model for both tasks [16]. The system is used to create artificial adaptation data, and later itself is adapted on the transformed data (coding, noise etc.)…”
Section: Introductionmentioning
confidence: 58%
“…Baseline performance evaluation of the UASR system [16] was performed initially, in order to assess the effects of speech and audio compression on recognition performance. For training and evaluation, the natural speech data subset VM2 of Verbmobil German Database was used [19].…”
Section: Evaluation Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Related to this topic, Eichner et al applied stochastic Markov graphs, which have enhanced capabilities for modeling trajectories, to statistical parametric synthesis (Eichner et al, 2000). Although this offers a flexible topology, it requires a search process for the state sequence at the synthesis stage (Eichner et al, 2001) because we need to determine a single-state sequence to generate speech parameters efficiently using the speech parameter generation algorithm (the Case 1 algorithm in (Tokuda et al, 2000)). Although we can skip this process by marginalizing all possible state sequences using an EM-type parameter generation algorithm (the Case 3 algorithm in (Tokuda et al, 2000)), this further increases computational complexity.…”
Section: Model Topologymentioning
confidence: 99%