Human 4.0 - From Biology to Cybernetic 2021
DOI: 10.5772/intechopen.89849
|View full text |Cite
|
Sign up to set email alerts
|

The Theory behind Controllable Expressive Speech Synthesis: A Cross-Disciplinary Approach

Abstract: As part of the Human-Computer Interaction field, Expressive speech synthesis is a very rich domain as it requires knowledge in areas such as machine learning, signal processing, sociology, psychology.In this Chapter, we will focus mostly on the technical side. From the recording of expressive speech to its modeling, the reader will have an overview of the main paradigms used in this field, through some of the most prominent systems and methods.We explain how speech can be represented and encoded with audio fea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…The voice quality and the number of control parameters depend on the synthesis technique used [1,5]. These parameters allow variations to be created in the voice.…”
Section: Related Work and Challengesmentioning
confidence: 99%
“…The voice quality and the number of control parameters depend on the synthesis technique used [1,5]. These parameters allow variations to be created in the voice.…”
Section: Related Work and Challengesmentioning
confidence: 99%
“…Speech synthesis methods can be grouped in three main categories: synthesis by concatenation, parametric synthesis and statistical parametric synthesis [4]. Among the few studies on laughter synthesis, the first attempts included techniques like synthesis by diphone concatenation [5], parametric synthesis and by using a mass-spring approach [6].…”
Section: Related Workmentioning
confidence: 99%
“…Generating natural speech is a fundamental building block in improving human-computer interaction (Tits et al, 2019). Modeling and converting emotion in speech is arguably one of the main challenges in developing more natural and expressive speech synthesis models.…”
Section: Introductionmentioning
confidence: 99%