2005
DOI: 10.1121/1.1943907
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating models of vowel perception

Abstract: There is a long-standing debate concerning the efficacy of formant-based versus whole spectrum models of vowel perception. Categorization data for a set of synthetic steady-state vowels were used to evaluate both types of models. The models tested included various combinations of formant frequencies and amplitudes, principal components derived from excitation patterns, and perceptually scaled LPC cepstral coefficients. The stimuli were 54 five-formant synthesized vowels that had a common F1 frequency and varie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
22
1

Year Published

2005
2005
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(24 citation statements)
references
References 34 publications
1
22
1
Order By: Relevance
“…These results would not be predicted based on current formant models of vowel perception, which focus primarily on information in the frequency region that was removed from the vowel signals used in the current study. As previously noted, research to date has focused on lower frequency spectral resonances when describing vowels and examining vowel identification performance (Peterson and Barney, 1952;Lehiste and Peterson, 1959;Miller, 1989;Hillenbrand et al, 1995;Molis, 2005). In contrast, the present results provide support for the study of perceptual information contained in the high-frequency region of vowels.…”
Section: B Preliminary Discussioncontrasting
confidence: 75%
See 1 more Smart Citation
“…These results would not be predicted based on current formant models of vowel perception, which focus primarily on information in the frequency region that was removed from the vowel signals used in the current study. As previously noted, research to date has focused on lower frequency spectral resonances when describing vowels and examining vowel identification performance (Peterson and Barney, 1952;Lehiste and Peterson, 1959;Miller, 1989;Hillenbrand et al, 1995;Molis, 2005). In contrast, the present results provide support for the study of perceptual information contained in the high-frequency region of vowels.…”
Section: B Preliminary Discussioncontrasting
confidence: 75%
“…Vowels are characteristically described according to low-frequency resonance characteristics (e.g., Peterson and Barney, 1952;Hillenbrand et al, 1995;Molis, 2005;Swanepoel et al, 2012). Classically, the study of vowel perception has focused on the lowest formant frequencies, typically F1, F2, and F3.…”
Section: Introductionmentioning
confidence: 99%
“…Since the classic study by Peterson and Barney ͑1952͒ on the distribution of the vowel formant frequencies on the F1-F2 plane, many studies were conducted in quiet on vowel perception ͑see reviews by Strange, 1989 of difference limens for formant discrimination ͑Flanagan, 1955; Hawks, 1994;Liu and Kewley-Port, 2001͒, vowel modeling ͑e.g., Syrdal and Gopal, 1986;Molis, 2005͒, and other aspects of vowel discrimination. Many factors were found to be important for vowel identification including formant frequencies ͑Peterson and Barney, 1952͒, vowel duration ͑e.g., Ainsworth, 1972͒, F0 ͑e.g., Lehiste andMeltzer, 1973͒, spectral contrast ͑Leek et al, 1987;Poroy, 2001͒, formant contour ͑Hillenbrand andGayvert, 1993;Hillenbrand and Nearey, 1999͒, spectral shape ͑e.g., Zahorian and Jaghargi, 1986;Ito et al, 2001͒, and spectral change ͑e.g., Strange et al, 1983͒. Of these factors, the formant frequencies and spectral shape, in particular, have been found to be major cues to vowel perception.…”
Section: Introductionmentioning
confidence: 99%
“…At the most severe noise level, À10 dB SNR, vowels containing detailed information on the spectral shape were recognized significantly better than vowels consisting only of formant information. Despite the fact that formant cues alone have been found to be adequate for vowel perception in quiet (Delattre et al, 1952;Klatt, 1982;Molis, 2005), a more complete description of the vowel spectrum in severe noise conditions resulted in better recognition. Zahorian and Jagharghi (1993) proposed that the whole-spectrum representation of a vowel provides some relevant spectral information that is not provided by formants.…”
Section: Discussionmentioning
confidence: 99%