Discriminative utterance verification for connected digits recognition

Rahim, Mazin G.; Lee, Chin‐Hui; Juang, Biing‐Hwang

doi:10.1109/89.568733

Cited by 109 publications

(51 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In [7], we proposed the MVER training method that directly minimizes the verification error rates 1 . Let N c and N i be the number of correct and incorrect samples in the training data set respectively.…”

Section: ) Minimum-verification-error (Mve) Trainingmentioning

confidence: 99%

See 1 more Smart Citation

Minimization of Utterance Verification Error Rate as a Constrained Optimization Problem

Siu

Mak

2006

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

Abstract-Since utterance verification (UV) may be treated as a 2-class classification problem, it may be improved with discriminative training such as minimum verification error training or minimum verification error rate training. However, since in practice, one usually has to pick a specific false-acceptance or false-rejection rate for one's system, it is more desirable to optimize UV performance at a particular operating point. In this paper, we show that further improvement can be achieved by treating UV at a specific operating point as a constrained optimization problem.Index Terms-utterance verification, minimum verification error, minimum verification error rate

show abstract

Section: ) Minimum-verification-error (Mve) Trainingmentioning

confidence: 99%

“…In general, UV is treated as hypothesis testing [1], [2], [3] using the (log) likelihood ratio test: the ratio between the null hypothesis that the required word is spoken and the alternative hypothesis that it is not. A decision is made by comparing the ratio against a pre-set threshold.…”

Section: Introduction For Many Practical Speech Applications It Imentioning

confidence: 99%

Minimization of Utterance Verification Error Rate as a Constrained Optimization Problem

Siu

Mak

2006

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

show abstract

“…Chigier, 1992;Bourlard et al, 1994). In Sukkar (1994), Rahim, Lee and Juang (1995b) and Rose et al (1995) discriminative training methods based on minimum classification error training (Juang & Katagiri, 1992;Chou, Juang, Lee & Soong, 1994a) have been proposed for utterance verification. For example, Rose et al (1995) described a minimum classification error (MCE) training approach for keyword verification which adjusts the parameters of the null hypothesis and the alternative hypothesis models of a tied-mixture density HMM-based system.…”

Section: Introductionmentioning

confidence: 99%

String-based minimum verification error (SB-MVE) training for speech recognition

Rahim

Lee²

1997

Computer Speech & Language

View full text Add to dashboard Cite

“…However, for simplicity, only the irrelevant document with the highest relevance score, or K=1, is selected for training in this study. There were previous detailed discussions of this issue [Rahim et al 1997;Juang et al 1997]. The classification error function in equation (10) can be transformed into a loss function ranging from 0 to 1 with the Sigmoid operator:…”

Section: Minimum Classification Error (Mce) Trainingmentioning

confidence: 99%

A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents

Chen

Wang

Lee

2004

ACM Transactions on Asian Language Information Processing

View full text Add to dashboard Cite

__________________________________________________________________________________________In recent years, statistical modeling approaches have steadily gained in popularity in the field of information retrieval. This article presents an HMM/N-gram-based retrieval approach for Mandarin spoken documents. The underlying characteristics and the various structures of this approach were extensively investigated and analyzed. The retrieval capabilities were verified by tests with word-and syllable-level indexing features and comparisons to the conventional vector-space model approach. To further improve the discrimination capabilities of the HMMs, both the expectation-maximization (EM) and minimum classification error (MCE) training algorithms were introduced in training. Fusion of information via indexing word-and syllable-level features was also investigated. The spoken document retrieval experiments were performed on the Topic Detection and Tracking Corpora (TDT-2 and TDT-3). Very encouraging retrieval performance was obtained. INTRODUCTIONOver the past three decades, statistical modeling approaches for speech and language processing have been studied extensively. Among the approaches, hidden Markov modeling (HMM) for speech recognition is undoubtedly the most prevalent and effective [Jelinek 1997]. In this approach, a set of statistical phoneme-or word-level HMMs was trained beforehand with a labeled speech corpus; the probability of the test speech utterance with respect to the HMMs was then evaluated on the HMM network to find the optimal phoneme or word sequence with the maximum likelihood. This statistical paradigm was first introduced for the information retrieval problem by BBN Technologies [Miller et al., 1999] and by Ponte and Croft [1998] and Song and Croft [1999], indicating very good potential, and was then extended in a number of

show abstract

Discriminative utterance verification for connected digits recognition

Cited by 109 publications

References 26 publications

Minimization of Utterance Verification Error Rate as a Constrained Optimization Problem

Minimization of Utterance Verification Error Rate as a Constrained Optimization Problem

String-based minimum verification error (SB-MVE) training for speech recognition

A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents

Contact Info

Product

Resources

About