[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing 1992
DOI: 10.1109/icassp.1992.225950
|View full text |Cite
|
Sign up to set email alerts
|

A scheme for pitch extraction of speech using autocorrelation function with frame length proportional to the time lag

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

1992
1992
2010
2010

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 6 publications
0
5
0
Order By: Relevance
“…The F 0 contours were extracted by the modified autocorrelation analysis of the LPC residual [Hirose et al, 1992]. Syllable boundaries and rhyme boundaries were marked manually by visual inspection of the waveform and the spectrogram.…”
Section: Speech Datamentioning
confidence: 99%
“…The F 0 contours were extracted by the modified autocorrelation analysis of the LPC residual [Hirose et al, 1992]. Syllable boundaries and rhyme boundaries were marked manually by visual inspection of the waveform and the spectrogram.…”
Section: Speech Datamentioning
confidence: 99%
“…The relationship between frame lengths and the f0 of a speaker is complicated due to the inherent variation of frequency profiles from one speaker to the next (Hirose, Fujisaki, & Seto, 1992), but managing speaker specific analysis settings individuality requires extensive expertise and time and is impractical for large volumes of data. If the pitch floor is set it too low, very fast f0 changes will be missed, and if it is set too high, low f0 values will be neglected.…”
Section: Discussionmentioning
confidence: 99%
“…B: Reading of a chapter of a book by another male speaker (consisting of 85 sentences that are longer on the average than those of Speech Material A) recorded from a radio program "From My Bookshelf" by the Japan Broadcasting Corporation (NHK). These speech signals were digitized at 10 kHz with 16-bit precision, and the fundamental frequency was extracted by a modified autocorrelation analysis of the LPC residual signal [8]. the Japanese utterance: "Ikutsukano otodake sokokara shakuyooshite, raibuno fun'ikio sokonawazuni henshuusuru kotoga dekiru."…”
Section: Speech Materialsmentioning
confidence: 99%