2019
DOI: 10.1088/1742-6596/1237/2/022106
|View full text |Cite
|
Sign up to set email alerts
|

An Audio-Visual Whisper Database in Chinese

Abstract: Converting whisper to normal vocalized speech has been a hot research topic in speech signal processing area. A complete and large scale whisper database is a major basis for this task. In this paper, we propose a multimodal whisper database in Chinese mandarin. A total of 103 syllables and 100 sentences were carefully selected. 5 male and 5 female participants pronounced the syllables and sentences in whisper and normal styles respectively, result in 4096 parallel speech utterances and 263, 849 frames of voic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…Whispered speech, also known as unvoiced speech and typically produced with no vocal-cord vibration, is characterized by low-energy (Zhou et al, 2019 ). As opposed to “normal” speech, the speech produced through the use of voiced sounds with harmonic excitation, whispered speech is produced with broad-band noise (Zhou et al, 2019 ), being, for instance, the typical form of communication for individuals diagnosed with aphonia (Zhou et al, 2019 ). In our hypersonic world, whispered speech, which usually requires closeness between speaker and listener (Li, 2011 ), presents an inherent affective component.…”
Section: Related Workmentioning
confidence: 99%
“…Whispered speech, also known as unvoiced speech and typically produced with no vocal-cord vibration, is characterized by low-energy (Zhou et al, 2019 ). As opposed to “normal” speech, the speech produced through the use of voiced sounds with harmonic excitation, whispered speech is produced with broad-band noise (Zhou et al, 2019 ), being, for instance, the typical form of communication for individuals diagnosed with aphonia (Zhou et al, 2019 ). In our hypersonic world, whispered speech, which usually requires closeness between speaker and listener (Li, 2011 ), presents an inherent affective component.…”
Section: Related Workmentioning
confidence: 99%