11th ISCA Speech Synthesis Workshop (SSW 11) 2021
DOI: 10.21437/ssw.2021-1
|View full text |Cite
|
Sign up to set email alerts
|

Identifying the vocal cues of likeability, friendliness and skilfulness in synthetic speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…The evaluation of these systems have consistently suggested improvements in the existing synthesis procedure [14,15,16,17]. In our previous work, we have studied the commercial TTS systems, Google 1 and Amazon voices 2 [18,19]. Our study shows various speaker attributes contributing to the perception of the universal dimensions (warmth and competence) in synthetic speech [18].…”
Section: Introductionmentioning
confidence: 86%
See 4 more Smart Citations
“…The evaluation of these systems have consistently suggested improvements in the existing synthesis procedure [14,15,16,17]. In our previous work, we have studied the commercial TTS systems, Google 1 and Amazon voices 2 [18,19]. Our study shows various speaker attributes contributing to the perception of the universal dimensions (warmth and competence) in synthetic speech [18].…”
Section: Introductionmentioning
confidence: 86%
“…DSSC represents the desired social speaker characteristics from synthetic speech [18]. DAV represents derived acoustic features [19]. To generate highly warm female speech, we studied the acoustic features that are commonly found in the speaker attributes, likeability and friendliness: F1 mean, F2 mean, spectral flux.…”
Section: Overviewmentioning
confidence: 99%
See 3 more Smart Citations