2010
DOI: 10.3758/brm.42.4.1030
|View full text |Cite
|
Sign up to set email alerts
|

Integrated software for analysis and synthesis of voice quality

Abstract: Voice quality is an important perceptual cue in many disciplines, but knowledge of its nature is limited by a poor understanding of the relevant psychoacoustics. This article (aimed at researchers studying voice, speech, and vocal behavior) describes the UCLA voice synthesizer, software for voice analysis and synthesis designed to test hypotheses about the relationship between acoustic parameters and voice quality perception. The synthesizer provides experimenters with a useful tool for creating and modeling v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
9

Relationship

4
5

Authors

Journals

citations
Cited by 32 publications
(21 citation statements)
references
References 40 publications
0
21
0
Order By: Relevance
“…The new time-domain source waveform was generated by inverse Fourier transform, after which the voice was resynthesized with the new source but with all other parameters held constant ͓Fig. 1͑b͔͒ ͑see Kreiman et al, 2010, for more details regarding this process͒. In this manner, we created two sets of stimuli for each target voice: one in which H1 amplitude increased in 15 steps of 0.5 dB, and one in which it decreased in 15 steps of 0.5 dB.…”
Section: A Stimulimentioning
confidence: 99%
“…The new time-domain source waveform was generated by inverse Fourier transform, after which the voice was resynthesized with the new source but with all other parameters held constant ͓Fig. 1͑b͔͒ ͑see Kreiman et al, 2010, for more details regarding this process͒. In this manner, we created two sets of stimuli for each target voice: one in which H1 amplitude increased in 15 steps of 0.5 dB, and one in which it decreased in 15 steps of 0.5 dB.…”
Section: A Stimulimentioning
confidence: 99%
“…For this reason, formant frequencies and bandwidths were derived through analysis-by-synthesis. Using the UCLA voice synthesizer (Kreiman et al, 2010a), the original voice samples were copy-synthesized using the following procedure. Because apparent harmonic amplitudes can vary depending on formant frequencies and bandwidths as well as source characteristics, vowel quality was first matched to the target, before the source was altered.…”
Section: Acoustic Measurementsmentioning
confidence: 99%
“…However, these concerns have become largely irrelevant. Computational constraints and artifacts are no longer common Kreiman et al, 2010), and much progress has been made in determining spectral parameters that vary across voices and to which listeners are sensitive (Kreiman et al, 2007a;Kreiman et al, 2014). Both approaches (modeling in the time domain vs in the spectral domain) share the goal of mapping between voice production and perception; but in the second case the functional significance of the magic moments or places is established a priori, ensuring that results will be perceptually meaningful, however complex the associations between physical and psychoacoustic events prove to be.…”
Section: General Discussion and Conclusionmentioning
confidence: 99%
“…Estimates of the sources of these voice samples were then derived via inverse filtering using the method described by Javkin et al (1987). Because inverse filtering is imprecise at best (e.g., Alku, 2011), these estimates were corrected using analysis-by-synthesis (AbS) to create synthetic voice samples that precisely matched the quality of the original natural voice samples (Kreiman et al, 2010), as follows. The spectrum of a single representative source pulse extracted via inverse filtering was calculated and used to model the harmonic part of the voice source.…”
Section: Modeling Glottal Pulses a Voice Samplesmentioning
confidence: 99%