2020
DOI: 10.1016/j.specom.2020.02.004
|View full text |Cite
|
Sign up to set email alerts
|

Analysis and classification of phonation types in speech and singing voice

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

4
4

Authors

Journals

citations
Cited by 20 publications
(18 citation statements)
references
References 58 publications
(112 reference statements)
0
18
0
Order By: Relevance
“…In the future, we plan to explore the Mel-SFF spectrogram derived features for dialect identification in noisy conditions [27], [28], [30] and for larger corpora. Further, we plan to investigate the complementary information between the SFF based spectrograms and zero-time windowing (ZTW) based spectrograms, which were shown to give better performance over STFT [42]- [44].…”
Section: Discussionmentioning
confidence: 99%
“…In the future, we plan to explore the Mel-SFF spectrogram derived features for dialect identification in noisy conditions [27], [28], [30] and for larger corpora. Further, we plan to investigate the complementary information between the SFF based spectrograms and zero-time windowing (ZTW) based spectrograms, which were shown to give better performance over STFT [42]- [44].…”
Section: Discussionmentioning
confidence: 99%
“…This impulse-like component is caused by the sudden deceleration of the air flow in the vicinity of the GCI due to adduction of the vocal folds. The characteristics of this impulse-like event in the excitation waveform, particularly its sharpness, are closely associated with several important speech attributes, such as voice quality [11], [34] and loudness [35], [36]. The excitation information also includes identifying the locations of GOIs, where secondary excitation of the vocal tract might take place.…”
Section: Demonstration Of the Production Of Voiced Speech In Two Phonation Types: Normal (Left) And Breathy (Right) The Upper Part Of Thementioning
confidence: 99%
“…Kadiri et al [34], Kadiri and Yegnanarayana [182], [183], and Kadiri and Alku [184] explored the features derived from the ZFF, ZTW, and SFF methods for discriminating phonation types. In these studies, cepstral coefficients were obtained from the spectra estimated by the three methods, and the cepstral coefficients were used in addition to excitation information scalar features [185], [186] (such as spectral statistics).…”
Section: A Study Of Phonation Typesmentioning
confidence: 99%
See 1 more Smart Citation
“…Glottal source features were used with the mel-frequency cepstral coefficients (MFCCs) in [18,23] for the automatic classification of voice qualities from speech signals. The experiments reported in [31,32] used features derived from the approximate glottal source signals such as the linear prediction residual and the zero frequency filtered signal (ZFF signal) to characterize voice qualities in speech and singing. Features including the energy of the ZFF signal, the slope of the ZFF signal at zero crossings, the energy of excitation, and the loudness measure were used for analysis and automatic classification of voice qualities in [31,32].…”
Section: Introductionmentioning
confidence: 99%