Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-2042
|View full text |Cite
|
Sign up to set email alerts
|

Effects of Dimensional Input on Paralinguistic Information Perceived from Synthesized Dialogue Speech with Neural Network

Abstract: A novel method of controlling paralinguistic information in neural network-based dialogue speech synthesis is proposed. Controlling paralinguistic information was achieved by feeding emotion dimensions in continuous values into the input layer of the neural networks. Compared to the method using the multiple regression HMM, the naturalness of synthesized speech was improved. The controllability of paralinguistic information was evaluated by examining the shift of the distribution of synthesized parameters. A s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 9 publications
0
1
0
Order By: Relevance
“…Recently, Ben-David and Shechtman proposed a prosody-controllable speech synthesis that can express expressive styles given conversational context [42]. Besides these, no work on spontaneous conversational speech synthesis was found in Interspeech conferences or SSW (Speech Synthesis Workshops) presentations, except for the authors' works [43], [44]. One reason for this might be that it has been unclear what a spontaneously speaking machine could be useful for.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, Ben-David and Shechtman proposed a prosody-controllable speech synthesis that can express expressive styles given conversational context [42]. Besides these, no work on spontaneous conversational speech synthesis was found in Interspeech conferences or SSW (Speech Synthesis Workshops) presentations, except for the authors' works [43], [44]. One reason for this might be that it has been unclear what a spontaneously speaking machine could be useful for.…”
Section: Introductionmentioning
confidence: 99%