Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
DOI: 10.1109/icslp.1996.607170
|View full text |Cite
|
Sign up to set email alerts
|

On improving discrimination capability of an RNN based recognizer

Abstract: This paper presents a set of eective and ecient techniques to improve the discrimination capability of a recurrent neural network (RNN) based isolated word recognizer. The recognizer contains a set of individually trained RNN speech models (RSMs). Each of them represents a dierent word in the vocabulary. Speech recognition is performed by selecting the RSM that best matches the input utterance. For temporal supervised training of the RSMs, a new error function is introduced, in which the contributions of all p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…During the first stage, each RSM is trained independently with positive examples, i.e., utterances carrying the designated syllable. Temporal supervised training is carried out using the real-time-recurrent-learning (RTRL) algorithm proposed by Williams and Zipser [23] and the following error function is minimized [24]:…”
Section: Base Syllable Recognition Using Rnn'smentioning
confidence: 99%
See 1 more Smart Citation
“…During the first stage, each RSM is trained independently with positive examples, i.e., utterances carrying the designated syllable. Temporal supervised training is carried out using the real-time-recurrent-learning (RTRL) algorithm proposed by Williams and Zipser [23] and the following error function is minimized [24]:…”
Section: Base Syllable Recognition Using Rnn'smentioning
confidence: 99%
“…By using the proposed discriminative training algorithm, the accuracy was improved greatly to 87.1%. On the other hand, the RSM based recognition algorithm was applied to the recognition of 20 English words (ten isolated digits plus ten command words) and an accuracy of 91.9% was achieved in the multispeaker case [12], [24].…”
Section: Pass 4-nearest Model Selectionmentioning
confidence: 99%