2000
DOI: 10.1162/coli.2000.26.2.294b
|View full text |Cite
|
Sign up to set email alerts
|

Techniques in Speech Acoustics

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2000
2000
2021
2021

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 40 publications
(12 citation statements)
references
References 0 publications
0
12
0
Order By: Relevance
“…Both pitch-asynchronous linear prediction ͑LP͒ procedures benefited from signal pre-emphasis, which is intended to cancel out the spectral tilt of the glottal source and increase the accuracy of the formant estimates. 12 The formant estimates obtained using the covariance algorithm only on the closed phase with or without pre-emphasis were considerably less accurate, despite the occasional advocacy of this latter technique as the most accurate. 13 Closed phase linear prediction analysis is extremely sensitive to properly locating the analysis window, and it appears that it is not possible to overcome these problems using a manual system such as PRAAT. Reassigned spectrograms were computed for a brief excerpt from the middle of each vowel encompassing five or six glottal cycles.…”
Section: Methodsmentioning
confidence: 99%
“…Both pitch-asynchronous linear prediction ͑LP͒ procedures benefited from signal pre-emphasis, which is intended to cancel out the spectral tilt of the glottal source and increase the accuracy of the formant estimates. 12 The formant estimates obtained using the covariance algorithm only on the closed phase with or without pre-emphasis were considerably less accurate, despite the occasional advocacy of this latter technique as the most accurate. 13 Closed phase linear prediction analysis is extremely sensitive to properly locating the analysis window, and it appears that it is not possible to overcome these problems using a manual system such as PRAAT. Reassigned spectrograms were computed for a brief excerpt from the middle of each vowel encompassing five or six glottal cycles.…”
Section: Methodsmentioning
confidence: 99%
“…The vowel target was usually marked where Fl reached a maximum value in open vowels and where F2 reached a maximum/minimum value in front/back vowels (Harrington & Cassidy, 1999). If the formants showed either little change or no evidence of reaching an asymptote within the vowel, an intensity peak was sometimes used to position the vowel target; if there was no evidence of an intensity peak, then the vowel target was positioned at the vowel's acoustic midpoint.…”
Section: Methodsmentioning
confidence: 99%
“…At the end of this step, the energy of each band-pass filter is calculated. Then, in the fifth-step, it is subjected to the logarithmic compression for mimicking the humans' audio perception [65]. The discrete cosine transform (DCT) of the logarithmic output is taken to de-correlate the coefficients, and hence, the static features of the input signal are obtained, at the final step.…”
Section: Mel-frequency Cepstral Coefficientsmentioning
confidence: 99%