Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-979
|View full text |Cite
|
Sign up to set email alerts
|

Direct Expressive Voice Training Based on Semantic Selection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 10 publications
0
2
0
Order By: Relevance
“…Semantic vector representations of text have been used to perform a look-up in the training corpus for expressive speech data according to the textual input, such that, relying on semantic information, data clusters were used to train expressive voices via speaker adaptation, as for example in [1]. A logical evolution of this study is to use embeddings which are more dedicated to the expressiveness in text.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Semantic vector representations of text have been used to perform a look-up in the training corpus for expressive speech data according to the textual input, such that, relying on semantic information, data clusters were used to train expressive voices via speaker adaptation, as for example in [1]. A logical evolution of this study is to use embeddings which are more dedicated to the expressiveness in text.…”
Section: Introductionmentioning
confidence: 99%
“…A further improvement in comparison to work presented in [1] is the migration from HMM-based synthesis to DNN-based synthesis. A main drawback of the HMM-based synthesis is that the training data is clustered.…”
Section: Introductionmentioning
confidence: 99%