2015
DOI: 10.15439/2015f386
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition

Abstract: Abstract-We investigate whether language models used in automatic speech recognition (ASR) should be trained on speech transcripts rather than on written texts. By calculating log-likelihood statistic for part-of-speech (POS) n-grams, we show that there are significant differences between written texts and speech transcripts. We also test the performance of language models trained on speech transcripts and written texts in ASR and show that using the former results in greater word error reduction rates (WERR),… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…This article contains a general statistics of Polish language that can be useful for a variety of language and speech processing applications, including automatic speech recognition with language models [55].…”
Section: Example Of Practical Application Of the Obtained Results Formentioning
confidence: 99%
“…This article contains a general statistics of Polish language that can be useful for a variety of language and speech processing applications, including automatic speech recognition with language models [55].…”
Section: Example Of Practical Application Of the Obtained Results Formentioning
confidence: 99%
“…they describe relations between words (or other tokens), thus enabling to choose most probable sequences. This proves to be especially useful in speech recognition, where acoustical models usually produce a number of hypotheses, and reranking them according to a language model can substantially improve recognition rates [10] To compare the performance of our Fongbe recognition system, we built two language models (LM) using the same text corpus. The first language model (LM1) is built with the original texts after normalization and contain different tonal vowels.…”
Section: B Text Corpusmentioning
confidence: 99%