2008
DOI: 10.1109/icassp.2008.4518552
|View full text |Cite
|
Sign up to set email alerts
|

Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(11 citation statements)
references
References 5 publications
0
11
0
Order By: Relevance
“…[17,18]) and by adding efficient features (e.g. [6,7,8,9,18]). Second, we will revise our discriminative ETC methods so that they are capable of detecting multiple (consecutive) deletion errors and estimating their number (i.e.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…[17,18]) and by adding efficient features (e.g. [6,7,8,9,18]). Second, we will revise our discriminative ETC methods so that they are capable of detecting multiple (consecutive) deletion errors and estimating their number (i.e.…”
Section: Discussionmentioning
confidence: 99%
“…They consist of the features (IDs 1 to 10) used in the recent discriminative model based confidence estimation and OOV detection methods (e.g. [6,7,8,9]) and the WANbased CSID probabilities with some additional features (IDs 11 to 17).…”
Section: Discriminative Error Type Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…A related area of work is ASR confidence estimation, which seeks to label erroneous words in an ASR hypothesis. [14] uses the comparison of phones in a strong ASR system and a weak ASR system without a language model as features for error detection. Regions where the difference is large indicate a higher likelihood of errors.…”
Section: Related Workmentioning
confidence: 99%
“…Numerous works have addressed this aspect of LVCSR in various application scenarios. One scenario is OOV detection, i.e., finding the parts of the utterance that correspond to words unknown to the recognizer [1], [2], [3] where popular approaches include using generalized word or subword filler models, aligning word and subword representations and using confidence measures obtained from various sources. The other scenario is OOV modeling, i.e., the derivation of a lexical representation for the speech component of an unknown word.…”
Section: Introductionmentioning
confidence: 99%