Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
DOI: 10.1109/icassp.2005.1415298
|View full text |Cite
|
Sign up to set email alerts
|

A New ASR Evaluation Measure and Minimum Bayes-Risk Decoding for Open-domain Speech Understanding

Abstract: A new evaluation measure of speech recognition and a decoding strategy for keyword-based open-domain speech understanding are presented. Conventionally, WER (word error rate) has been widely used as an evaluation measure of speech recognition, which treats all words in a uniform manner. In this paper, we define a weighted keyword error rate (WKER) which gives a weight on errors from a viewpoint of information retrieval. We first demonstrate that this measure is more appropriate for predicting the performance o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0
2

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 8 publications
0
8
0
2
Order By: Relevance
“…Approaches beyond the "cascading framework" were also proposed. For example, just as ASR can be optimized for spoken content retrieval in Section III, ASR can also be optimized for summarization by considering the word significance in minimum Bayes-risk decoding [327]. In addition, the prosodic features can help not only retrieval as in Section IV-B, but summarization too [321], [328]- [330], since prosodic features help to identify the important part in speech.…”
Section: B Summarization Title Generation and Key Term Extraction Fmentioning
confidence: 99%
“…Approaches beyond the "cascading framework" were also proposed. For example, just as ASR can be optimized for spoken content retrieval in Section III, ASR can also be optimized for summarization by considering the word significance in minimum Bayes-risk decoding [327]. In addition, the prosodic features can help not only retrieval as in Section IV-B, but summarization too [321], [328]- [330], since prosodic features help to identify the important part in speech.…”
Section: B Summarization Title Generation and Key Term Extraction Fmentioning
confidence: 99%
“…However, such an error may not significantly change the meaning of the sentence and in fact may be sufficiently correct for clinical use. This can be partly mitigated by using relative word importance to re-weight the final metric accordingly 79,80 . However, this still measures word-level equivalence rather than sentence-level resemblance 81 .…”
Section: Measures Of Automatic Speech Recognition Performancementioning
confidence: 99%
“…For ASR of morphologically-rich languages, some more adequate metrics can be applied: Letter/Character Error Rate (LER or CER) (Kurimo et al, 2006a,b), Phone Error Rate (PER), Syllable Error Rate (SylER) (Huang et al, 2000) or Morpheme Error Rate (Ablimit et al, 2010). There exist also some other measures, such as Inflectional Word Error Rate (IWER) (Bhanuprasad and Svenson, 2008;Karpov et al, 2011), Speaker Attributed Word Error Rate (NIST, 2009), Weighted Word Error Rate (WWER) (Nanjo and Kawahara, 2005), etc.…”
Section: Evaluating Asr Performancementioning
confidence: 99%