Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1857
|View full text |Cite
|
Sign up to set email alerts
|

Exemplar-based Speech Waveform Generation

Abstract: This paper presents a simple but effective method for generating speech waveforms by selecting small units of stored speech to match a low-dimensional target representation. The method is designed as a drop-in replacement for the vocoder in a deep neural network-based text-to-speech system. Most previous work on hybrid unit selection waveform generation relies on phonetic annotation for determining unit boundaries, or for specifying target cost, or for candidate preselection. In contrast, our waveform generato… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
7
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
3

Relationship

3
0

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 14 publications
1
7
1
Order By: Relevance
“…We found that this simple search was sufficient as units are too short to deviate from the target sequence in the course of a single unit [8].…”
Section: Unit Searchmentioning
confidence: 97%
See 4 more Smart Citations
“…We found that this simple search was sufficient as units are too short to deviate from the target sequence in the course of a single unit [8].…”
Section: Unit Searchmentioning
confidence: 97%
“…From these indices a sequence of higher dimension acoustic features is created and used for waveform reconstruction. In this section we will summarise the waveform generation method proposed in [8] that forms the basis of the hybrid TTS framework proposed in this paper.…”
Section: Proposed Text-to-speech System With Examplar-based Speech Wamentioning
confidence: 99%
See 3 more Smart Citations