2014 IEEE Spoken Language Technology Workshop (SLT) 2014
DOI: 10.1109/slt.2014.7078579
|View full text |Cite
|
Sign up to set email alerts
|

Effective data-driven feature learning for detecting name errors in automatic speech recognition

Abstract: This paper addresses the problem of detecting name errors in automatic speech recognition (ASR) output. The highly skewed label distributions (i.e. name errors are infrequent), sparse training data, and large number of potential lexical features pose significant challenges for training name error classification systems. Data-driven feature learning is needed for handling multiple languages but is sensitive to over fitting. We address the problem by designing aggregate features using a related (sentence-level n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2015
2015
2015
2015

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…These features are combined in a maximum entropy (ME) classifier trained to predict name errors directly. This is the same as the baseline used in (He et al, 2014;Marin, 2015;, but with a different ASR system.…”
Section: System Overview and Tasksmentioning
confidence: 99%
See 3 more Smart Citations
“…These features are combined in a maximum entropy (ME) classifier trained to predict name errors directly. This is the same as the baseline used in (He et al, 2014;Marin, 2015;, but with a different ASR system.…”
Section: System Overview and Tasksmentioning
confidence: 99%
“…In this paper, we address these general problems -detecting rare events in an open-domain taskspecifically for name error detection. Prior work addressed the problem of skewed priors by artificially increasing the error rate by holding names out of the vocabulary (Chen et al, 2013) or by factoring the problem into sentence-level name detection and OOV word detection (He et al, 2014) (since OOV errors in general are more frequent than name errors). Sentence-level features are also shown to be more robust than local context in direct name error prediction (Marin, 2015).…”
Section: Ref: What Can We Get At Litanfeethmentioning
confidence: 99%
See 2 more Smart Citations