1973
DOI: 10.1109/tau.1973.1162466
|View full text |Cite
|
Sign up to set email alerts
|

The influence of glottal waveform on the naturalness of speech from a parallel formant synthesizer

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
30
0

Year Published

1974
1974
2017
2017

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 91 publications
(31 citation statements)
references
References 6 publications
1
30
0
Order By: Relevance
“…In this area, knowledge achieved via GIF has been used in the forms of artificial glottal source models and voice source control rules that have been utilized in several studies during the past three decades (e.g., Klatt 1987;Carlson et al 1989;Pinto et al 1989;de Veth et al 1990;Klatt & Klatt 1990;Carlson et al 1991;Karlsson 1991;Karlsson 1992;Fant 1993;Childers & Hu 1994;Childers 1995). In addition to these earlier studies, where the synthesis is typically rule-based and utilizes, for instance, the LF-model, GIF has also been used as a sound synthesis technique, in which the glottal source extracted from real speech is used to excite artificial vocal tract models (Holmes 1973;Matsui et al 1991;Alku et al 1999). These synthesis techniques enable generation of high-quality sounds, and the vocal tract characteristics can be controlled.…”
Section: Applications Of Gifmentioning
confidence: 99%
“…In this area, knowledge achieved via GIF has been used in the forms of artificial glottal source models and voice source control rules that have been utilized in several studies during the past three decades (e.g., Klatt 1987;Carlson et al 1989;Pinto et al 1989;de Veth et al 1990;Klatt & Klatt 1990;Carlson et al 1991;Karlsson 1991;Karlsson 1992;Fant 1993;Childers & Hu 1994;Childers 1995). In addition to these earlier studies, where the synthesis is typically rule-based and utilizes, for instance, the LF-model, GIF has also been used as a sound synthesis technique, in which the glottal source extracted from real speech is used to excite artificial vocal tract models (Holmes 1973;Matsui et al 1991;Alku et al 1999). These synthesis techniques enable generation of high-quality sounds, and the vocal tract characteristics can be controlled.…”
Section: Applications Of Gifmentioning
confidence: 99%
“…Other approaches to modeling the voice source include those motivated by physical modeling and include models such as Ishizaka and Flanagan [6] and Story and Titze [7]. The importance of accurately reproducing the voice source signal in speech synthesis is described in [8], where experimentation has shown that a parallel formant synthesizer can generate short speech segments indistinguishable from real speech provided it is driven by an inverse-filtered typical natural vowel from the same talker. A related approach is described in [9] where cepstrum coefficients are used to generate a single average voice source waveform from which any speech signal can be synthesized.…”
Section: Introductionmentioning
confidence: 99%
“…Early models of the source signal used a simple impulse train for modeling voiced signals. More recent studies model the shape of the glottal airflow or its derivative in the timedomain ͑Ananthapadmanabha, 1984; Fant et al, 1985;Hedelin, 1984;Holmes, 1973;Klatt and Klatt, 1990;Rosenberg, 1971͒. Frequency-domain representations for some of those models were presented in Fant ͑1995͒ and Doval and d'Alessandro ͑1999͒.…”
Section: Introductionmentioning
confidence: 99%