Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-2863
|View full text |Cite
|
Sign up to set email alerts
|

Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech

Abstract: Voice source characteristics in different phonation types vary due to the tension of laryngeal muscles along with the respiratory effort. This study investigates the use of mel-frequency cepstral coefficients (MFCCs) derived from voice source waveforms for classification of phonation types in speech. The cepstral coefficients are computed using two source waveforms: (1) glottal flow waveforms estimated by the quasi-closed phase (QCP) glottal inverse filtering method and (2) approximate voice source waveforms o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

3
6

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…It can be seen that there are large variations especially in the harmonic structure of the glottal flow spectra between normal and pathological voice. In order to capture these variations and to represent them in a compact form, we propose to derive MFCCs from the spectra of the glottal source waveforms (as in our recent conference paper [99]). It should be noted that the proposed MFCC feature extraction is similar to the computation of conventional MFCC features, except that the proposed approach operates on the glottal source waveform instead of the acoustic voice signal.…”
Section: Extraction Of Mfccs From Glottal Source Waveformsmentioning
confidence: 99%
“…It can be seen that there are large variations especially in the harmonic structure of the glottal flow spectra between normal and pathological voice. In order to capture these variations and to represent them in a compact form, we propose to derive MFCCs from the spectra of the glottal source waveforms (as in our recent conference paper [99]). It should be noted that the proposed MFCC feature extraction is similar to the computation of conventional MFCC features, except that the proposed approach operates on the glottal source waveform instead of the acoustic voice signal.…”
Section: Extraction Of Mfccs From Glottal Source Waveformsmentioning
confidence: 99%
“…Here, 2M + 1 is the number of samples used to remove the trend. The ZFF signal is considered an approximate glottal source waveform [60,61], and the negative-to-positive zero-crossings (NPZCs) correspond to GCIs when considering the positive polarity of the signal [57,62]. The steps involved in the ZFF method are shown in Fig.…”
Section: Extraction Of Glottal Source Waveforms Using the Zff Methodsmentioning
confidence: 99%
“…Kadiri et al [34], Kadiri and Yegnanarayana [182], [183], and Kadiri and Alku [184] explored the features derived from the ZFF, ZTW, and SFF methods for discriminating phonation types. In these studies, cepstral coefficients were obtained from the spectra estimated by the three methods, and the cepstral coefficients were used in addition to excitation information scalar features [185], [186] (such as spectral statistics).…”
Section: A Study Of Phonation Typesmentioning
confidence: 99%
“…In these studies, cepstral coefficients were obtained from the spectra estimated by the three methods, and the cepstral coefficients were used in addition to excitation information scalar features [185], [186] (such as spectral statistics). Recently, in [184], the MFCCs computed from the glottal source waveforms estimated by the QCP method and the ZFF method were shown to be effective for the classification of different phonation types from speech signals.…”
Section: A Study Of Phonation Typesmentioning
confidence: 99%