Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181
DOI: 10.1109/icassp.1998.675422
|View full text |Cite
|
Sign up to set email alerts
|

Estimating the speaking rate by vowel detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
37
0
3

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 53 publications
(40 citation statements)
references
References 3 publications
0
37
0
3
Order By: Relevance
“…, C} are the numbers of vowels from a given alphabet. To solve the ASR task, query utterance X is automatically divided into K syllables (Janakiraman et al, 2010) and the vowel segment X(k) is extracted from the k-th syllable (Pfau and Ruske, 1998). We assume that syllables are extracted without mistakes, e.g., the voice commands are produced in isolated syllable mode (Merialdo, 1988).…”
Section: Voice Command Recognitionmentioning
confidence: 99%
“…, C} are the numbers of vowels from a given alphabet. To solve the ASR task, query utterance X is automatically divided into K syllables (Janakiraman et al, 2010) and the vowel segment X(k) is extracted from the k-th syllable (Pfau and Ruske, 1998). We assume that syllables are extracted without mistakes, e.g., the voice commands are produced in isolated syllable mode (Merialdo, 1988).…”
Section: Voice Command Recognitionmentioning
confidence: 99%
“…False alarms were reduced by rejecting peaks lower than a threshold and by rejecting all but one peak from sets of peaks that are too close in time. In Pfau and Ruske [1998] syllable nuclei were detected using a loudness function similar to Zwicker et al [1979], and false peaks were filtered in two ways. First, peaks that were not sufficiently close to a neighboring decline in the energy curve were removed.…”
Section: Syllable Detectionmentioning
confidence: 99%
“…The work in Dashiell et al [2008] combined nuclei estimations from a HMM based manner classifier, an energy curve method from Pfau and Ruske [1998], and the correlation based method from Wang and Narayanan Narayanan [2005, 2007]. It also estimated syllable boundaries by combining the manner classifier, the energy curve method from Pfau and Ruske [1998], and using minima of the spectral discontinuity features from Wu et al [1997], Shire [1997].…”
Section: Syllable Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…A recognizer that can distinguish between vowels and consonants, e.g., a broad phonetic class recognizer, or a vowel detection algorithm [10,11], would be sufficient for estimating speaking rate. The broad phonetic classes, e.g., nasals, stops, vowels, etc., possess more distinct spectral characteristics than the phones within the same broad phonetic classes.…”
Section: Introductionmentioning
confidence: 99%