2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) 2018
DOI: 10.1109/atsip.2018.8364511
|View full text |Cite
|
Sign up to set email alerts
|

Towards a generic approach for automatic speech recognition error detection and classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…A clear motivating example is provided by the exponential growth of black-box speech recognition services, as Google voice Search and automatic captions in Youtube videos, where no information is available about the system used to produce the transcriptions. In this paper, we extends our previous works [5,6] on ASR error detection to a new and different scenario where information about the inner workings of the ASR system is not accessible. Unlike most approaches reported in the literature, we propose to handle the speech recognition errors independently from the decoder's internal information using a set of features derived exclusively from the recognizer output and hence should be trainable for any ASR system.…”
Section: A C C E P T E D Mmentioning
confidence: 88%
See 1 more Smart Citation
“…A clear motivating example is provided by the exponential growth of black-box speech recognition services, as Google voice Search and automatic captions in Youtube videos, where no information is available about the system used to produce the transcriptions. In this paper, we extends our previous works [5,6] on ASR error detection to a new and different scenario where information about the inner workings of the ASR system is not accessible. Unlike most approaches reported in the literature, we propose to handle the speech recognition errors independently from the decoder's internal information using a set of features derived exclusively from the recognizer output and hence should be trainable for any ASR system.…”
Section: A C C E P T E D Mmentioning
confidence: 88%
“…ASR errors often are not single events [6]. This is because a miss-recognized word generates often a sequence of ASR errors.…”
Section: Classifiermentioning
confidence: 99%
“…Thereby RNN are only able to represent distributions in which the label values are conditionally independent from each other given the input values. ASR errors often are not single events [7]. This is because a miss-recognized word generates often a sequence of ASR errors, as illustrated in Fig.…”
Section: Classifiersmentioning
confidence: 99%
“…To tackle these problems, we have been developing a new approach for ASR error detection and error type classification [3,7,8]. We have targeted a new and different scenario where information about the inner workings of the ASR system is not accessible.…”
Section: Introductionmentioning
confidence: 99%