Dimensional Affective Speech Synthesis Based on Voice Conversion
Xin Zhang,
Yaobin Wan,
Wei Wang
Abstract:Affective speech synthesis can promote more natural human–computer interaction. Previous studies in the field of speech synthesis have used feature conversion to achieve natural affective speech. However, they focused on the adjustment of prosodic features and typically used a discrete emotion model; few studies on affective speech synthesis reflect the dimensional emotions expressed in daily life. To address these issues, we introduce a 2-dimensional valence–arousal emotion model into a speech synthesis syste… Show more
Set email alert for when this publication receives citations?
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.