International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.1990.115764
|View full text |Cite
|
Sign up to set email alerts
|

Neural networks for voiced/unvoiced speech classification

Abstract: This paper describes the results of designing, training, and testing a neural network for the voiced/unvoiced speech classification problem. A feed-forward multilayer backpropagation network was used with 6 input, 10 intemal, and 2 output nodes -for a binary decision. The six feames are common and easily computed. Training was done with 72 frames from two speakers; testing was done with 479 frames from four speakers; a total of 2 errors (0.4%) occurred. Thus a small neural network performs well on the V / W pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…In our view, these probabilistic graphical models provide a way to study feature detection that combines the benefits of prior knowledge and learning from examples. They enable researchers to bridge the divide between models based on expert engineering (Holmes, 1998) and those derived by automatic methods (Bendiksen & Steiglitz, 1990).…”
Section: Relation To Previous Workmentioning
confidence: 99%
“…In our view, these probabilistic graphical models provide a way to study feature detection that combines the benefits of prior knowledge and learning from examples. They enable researchers to bridge the divide between models based on expert engineering (Holmes, 1998) and those derived by automatic methods (Bendiksen & Steiglitz, 1990).…”
Section: Relation To Previous Workmentioning
confidence: 99%
“…Several methods of speech classification such as the autocorrelation function (ACF), short‐time energy of the signal (E), average magnitude difference function (AMDF), zero crossing rate (ZCR), cepstrum, discrete wavelet transform (DWT), and so on, that make use of acoustic features, have been reported in the literature [1,4–8]. A hybrid approach of speech classification such as the hidden Markov models, Gaussian mixture model or neural network (NN) model, that uses more than one feature, has been also reported [9‐21].…”
Section: Introductionmentioning
confidence: 99%
“…Bagavathi and Padma [13] presented a fuzzy c‐implies clustering method for classifying voiced and unvoiced activity using MFCC as features and achieved 91.5% classification accuracy. Bendiksen and Steiglitz [14] used an NN as a classifier for voiced/unvoiced speech classification. They extracted six features, namely, the rms energy of signal, the rms energy of the pre‐emphasized signal, the normalized autocorrelation coefficient of the signal at unit sample delay, the normalized autocorrelation coefficient of the pre‐emphasized signal at unit sample delay, the ratio of the signal energy above 4000 Hz to the signal energy below 2000 Hz, and the product of the signal energy above 4000 Hz to the signal energy below 2000 Hz.…”
Section: Introductionmentioning
confidence: 99%
“…Switched coders first try to classify the short-time speech frames into a small number of classes, and then use an appropriate coding scheme for each class. Here, the ANNs can be used to classify the frames, for example, as voiced, unvoiced, or silence (Bendiksen and Steiglitz 1990, Cohn 1991, Ghiselli-Crippa and El-Jaroudi 1991.…”
Section: F176 Speech Codingmentioning
confidence: 99%