IEEE International Conference on Acoustics Speech and Signal Processing 2002
DOI: 10.1109/icassp.2002.1005748
|View full text |Cite
|
Sign up to set email alerts
|

The DYPSA algorithm for estimation of glottal closure instants in voiced speech

Abstract: We present the DYPSA algorithm for automatic and reliable estimation of glottal closure instants (GCIs) in voiced speech. Reliable GCI estimation is essential for closed-phase speech analysis, from which can be derived features of the vocal tract and, separately, the voice source. It has been shown that such features can be used with significant advantages in applications such as speaker recognition. DYPSA is automatic and operates using the speech signal alone without the need for an EGG or Laryngograph signa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2006
2006
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(13 citation statements)
references
References 6 publications
0
13
0
Order By: Relevance
“…Finally, the method is especially promising for application to the "classic" low-frequency problem of the inversion to the vocal tract shape from the speech signal, 45 although further consideration must be given to the deconvolution of the glottal wave form 46,47 and to calibration of the scale factor.…”
Section: Discussionmentioning
confidence: 98%
“…Finally, the method is especially promising for application to the "classic" low-frequency problem of the inversion to the vocal tract shape from the speech signal, 45 although further consideration must be given to the deconvolution of the glottal wave form 46,47 and to calibration of the scale factor.…”
Section: Discussionmentioning
confidence: 98%
“…A similar idea for evaluating epoch extraction methods focused on determining GCIs was introduced in [33], being used later in [22] and [35]. However, other works, such as [30] and [43], propose to first align the marks before conducting the comparison. Nevertheless, the misalignments may lead to inaccurate evaluation results.…”
Section: Evaluation Measurementioning
confidence: 99%
“…Pitch marking algorithms are focused on determining the temporal position of the frame periods of voiced speech [30], according to a predefined local criterion, e.g., 1) the maximum positive/negative peak [18], [28], [31], [32], 2) the minimum before the zero crossing [10], 3) the GCI estimated from the speech signal [26], [30], [33], its wavelet transform [34], [35], or the EGG signal [10], [11], [25], [28], among others.…”
Section: Towards Reliable Pitch Markingmentioning
confidence: 99%
See 1 more Smart Citation
“…A dynamic programming projected phase-slope algorithm (DYPSA) for automatic estimation of glottal closure instants in voiced speech was presented in Kounoudes et al (2002) and Naylor et al (2007). The candidates for GCI were obtained from the zero-crossings of the phase-slope function derived from the energy weighted group-delay, and were refined by employing a dynamic programming algorithm.…”
Section: Dypsa Algorithm For Epoch Extractionmentioning
confidence: 99%