Towards a generic approach for automatic speech recognition error detection and classification

Errattahi, Rahhal; Hain, Thomas; Ouahmane, Hassan

doi:10.1109/atsip.2018.8364511

Cited by 6 publications

(4 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A clear motivating example is provided by the exponential growth of black-box speech recognition services, as Google voice Search and automatic captions in Youtube videos, where no information is available about the system used to produce the transcriptions. In this paper, we extends our previous works [5,6] on ASR error detection to a new and different scenario where information about the inner workings of the ASR system is not accessible. Unlike most approaches reported in the literature, we propose to handle the speech recognition errors independently from the decoder's internal information using a set of features derived exclusively from the recognizer output and hence should be trainable for any ASR system.…”

Section: A C C E P T E D Mmentioning

confidence: 88%

See 1 more Smart Citation

System-independent ASR error detection and classification using Recurrent Neural Network

Errattahi

Hain

Ouahmane

2019

Computer Speech & Language

Self Cite

View full text Add to dashboard Cite

show abstract

Section: A C C E P T E D Mmentioning

confidence: 88%

“…ASR errors often are not single events [6]. This is because a miss-recognized word generates often a sequence of ASR errors.…”

Section: Classifiermentioning

confidence: 99%

System-independent ASR error detection and classification using Recurrent Neural Network

Errattahi

Hain

Ouahmane

2019

Computer Speech & Language

Self Cite

View full text Add to dashboard Cite

show abstract

“…Thereby RNN are only able to represent distributions in which the label values are conditionally independent from each other given the input values. ASR errors often are not single events [7]. This is because a miss-recognized word generates often a sequence of ASR errors, as illustrated in Fig.…”

Section: Classifiersmentioning

confidence: 99%

“…To tackle these problems, we have been developing a new approach for ASR error detection and error type classification [3,7,8]. We have targeted a new and different scenario where information about the inner workings of the ASR system is not accessible.…”

Section: Introductionmentioning

confidence: 99%

Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection

et al. 2021

Self Cite

View full text Add to dashboard Cite

Speech based human-machine interaction and natural language understanding applications have seen a rapid development and wide adoption over the last few decades. This has led to a proliferation of studies that investigate Error detection and classification in Automatic Speech Recognition (ASR) systems. However, different data sets and evaluation protocols are used, making direct comparisons of the proposed approaches (e.g. features and models) difficult. In this paper we perform an extensive evaluation of the effectiveness and efficiency of state-of-the-art approaches in a unified framework for both errors detection and errors type classification. We make three primary contributions throughout this paper: (1) we have compared our Variant Recurrent Neural Network (V-RNN) model with three other state-of-the-art neural based models, and have shown that the V-RNN model is the most effective classifier for ASR error detection in term of accuracy and speed, (2) we have compared four features’ settings, corresponding to different categories of predictor features and have shown that the generic features are particularly suitable for real-time ASR error detection applications, and (3) we have looked at the post generalization ability of our error detection framework and performed a detailed post detection analysis in order to perceive the recognition errors that are difficult to detect.

show abstract

Incorporating label dependency for ASR error detection via RNN

Errattahi

Salmam

Ouahmane

2019

Procedia Computer Science

View full text Add to dashboard Cite

Towards a generic approach for automatic speech recognition error detection and classification

Cited by 6 publications

References 16 publications

System-independent ASR error detection and classification using Recurrent Neural Network

System-independent ASR error detection and classification using Recurrent Neural Network

Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection

Incorporating label dependency for ASR error detection via RNN

Contact Info

Product

Resources

About