2010
DOI: 10.1016/j.csl.2009.10.001
|View full text |Cite
|
Sign up to set email alerts
|

Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition

Abstract: International audienceMany automatic speech recognition (ASR) systems rely on the sole pronunciation dictionaries and language models to take into account information about language. Implicitly, morphology and syntax are to a certain extent embedded in the language models but the richness of such linguistic knowledge is not exploited. This paper studies the use of morpho-syntactic (MS) information in a post-processing stage of an ASR system, by reordering N-best lists. Each sentence hypothesis is first part-of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 26 publications
(13 citation statements)
references
References 23 publications
0
13
0
Order By: Relevance
“…, p m ) of m phonemes p i is assigned a score s(h) mixing the CRF and phonological model (PM) probabilities. This mixture is computed by a log-linear interpolation-which has been successfully used for N -best list reranking in various domains [18,19]-, and is formulated as follows:…”
Section: Phonological Rescoring and Rerankingmentioning
confidence: 99%
“…, p m ) of m phonemes p i is assigned a score s(h) mixing the CRF and phonological model (PM) probabilities. This mixture is computed by a log-linear interpolation-which has been successfully used for N -best list reranking in various domains [18,19]-, and is formulated as follows:…”
Section: Phonological Rescoring and Rerankingmentioning
confidence: 99%
“…Proper names are also a topic of interest in pronunciation (Reveil et al 2012;Schlippe et al 2014), which is also important for speech recognition. Huet et al (2010) proposed a post-processing method of speech recognizer hypotheses, which includes morphosyntactic description tagging. In the post processing step consecutive numerals and consecutive proper names are grouped into single cardinal and proper names tags.…”
Section: Previous Workmentioning
confidence: 99%
“…In the final steps, a 4-gram language model over a vocabulary of 65,000 words is used with context-dependent phone models to generate a list of 1,000 sentence transcription hypotheses. Morphosyntactic tagging, using a tagger specifically designed for ASR transcripts, is used in a post-processing stage to generate a final transcription by consensus from a confusion network, combining the acoustic, language model and morphosyntactic scores [25]. Confusion network posterior probabilities are used directly as confidence measures.…”
Section: Asr System and Confidence Measuresmentioning
confidence: 99%