Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181
DOI: 10.1109/icassp.1998.674357
|View full text |Cite
|
Sign up to set email alerts
|

Use of periodicity and jitter as speech recognition features

Abstract: addition of new feature components to the frame vectors.We investigate a class of features related to voicing parameters that indicate whether the vocal chords are vibrating. Features describing voicing characteristics of speech signals are integrated with an existing 38-dimensional feature vector consisting of first and second order time derivatives of the frame energy and of the cepstral coefficients with their first and second derivatives. HMM-based connected digit recognition experiments comparing the trad… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0
1

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(14 citation statements)
references
References 8 publications
0
13
0
1
Order By: Relevance
“…Another common approach to quantify periodicity relies on the auto-correlation (AC) function of the speech signal [9,11] by measuring the relative height of the maximum of this function in the plausible pitch range. The Average Magnitude Difference Function (AMDF) can be formulated as a function of the AC function.…”
Section: Robust Excitation-based Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…Another common approach to quantify periodicity relies on the auto-correlation (AC) function of the speech signal [9,11] by measuring the relative height of the maximum of this function in the plausible pitch range. The Average Magnitude Difference Function (AMDF) can be formulated as a function of the AC function.…”
Section: Robust Excitation-based Featuresmentioning
confidence: 99%
“…The first attempt was made by Thomson [9,10] who proposed the use of two voicing measures: an auto-correlation based measure of periodicity and the jitter to characterize the inter-frame pitch variation. When combined to cepstral features, a relative reduction of 40% of the string error rate was obtained on a connected digit recogntion task.…”
Section: Introductionmentioning
confidence: 99%
“…There have been also attempts at using articulatory information in the acoustic front-end, e.g. autocorrelation based voicedness feature [1]. In this paper we investigate the combination of different auditory based and articulatory based acoustic features.…”
Section: Introductionmentioning
confidence: 99%
“…In [1], liftered cepstral coefficients derived from all-poles magnitude spectrum has been directly concatenated with a voicedness feature. Using the concatenated features, a large relative improvement in word error rate (WER) has been achieved by applying discriminative training.…”
Section: Introductionmentioning
confidence: 99%
“…In [6], a sub-optimal solution is proposed for selecting features from two different sets. Other approaches integrate some specific parameters into a single stream of features [9]. Without attempting to find an optimal set of acoustic measurements, many recent automatic speech recognition (ASR) systems combine streams of different acoustic measurements ( [4], [10]).…”
Section: Introductionmentioning
confidence: 99%