Proceedings of the Fourth Workshop on Analytics for Noisy Unstructured Text Data 2010
DOI: 10.1145/1871840.1871849
|View full text |Cite
|
Sign up to set email alerts
|

Reshaping automatic speech transcripts for robust high-level spoken document analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…We will study the bias of the posterior probabilities, as it was shown in [21] to impact the NCE and thus the quality of confidence measures based on these probabilities. We will investigate using other features for the characterization step and/or the detection step, especially probabilities given by n-class LMs using POS classes (as the n-class part of our word-to-class backoff LM) that were proven to be efficient, in recent works in speech recognition [13,36]. It would also be of interest to investigate using other measures to optimize the error detection threshold, like the F-measure, and to study how it impacts the correction step and thus the overall error handling process.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…We will study the bias of the posterior probabilities, as it was shown in [21] to impact the NCE and thus the quality of confidence measures based on these probabilities. We will investigate using other features for the characterization step and/or the detection step, especially probabilities given by n-class LMs using POS classes (as the n-class part of our word-to-class backoff LM) that were proven to be efficient, in recent works in speech recognition [13,36]. It would also be of interest to investigate using other measures to optimize the error detection threshold, like the F-measure, and to study how it impacts the correction step and thus the overall error handling process.…”
Section: Resultsmentioning
confidence: 99%
“…They are also used in machine translation tasks to assess the quality of the translation, as in [40], where various measures are compared. Among all these confidence measures, word posterior probabilities have been shown to be among the best [5,23,41], and they can be combined with other information sources in a neural network [23], in a SVM [21], or using conditional random fields [13] to achieve even better results. These confidence measures can also be used to detect recognition errors by rejecting words, the value of which is below a considered threshold.…”
Section: Speech Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…Accessing large collections of audio files is difficult and intrinsically time-consuming. Browsing audio recordings is a difficult task for humans, and reviewing such documents quickly is technically difficult [Fayolle et al, 2010]. Automatic speech recognition (ASR) produces searchable text but is also expensive in terms of both time and computational resources.…”
Section: Motivationmentioning
confidence: 99%