2012
DOI: 10.1109/tasl.2012.2201477
|View full text |Cite
|
Sign up to set email alerts
|

Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition

Abstract: This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
22
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 32 publications
(24 citation statements)
references
References 39 publications
2
22
0
Order By: Relevance
“…This contrast is similar to some of the methods explored in [2] for open vocabulary recognition, and we confirm their finding that unsupervised morphology learning gives similar results compared to the rule-based system. Further, unlike this and other prior work which uses morphology in Turkish broadcast news transcription [18,4,19,2], our study involves keyword spotting in conversational Turkish with minimal training resources (10 vs. roughly 200 hours).…”
Section: Introductionsupporting
confidence: 87%
See 4 more Smart Citations
“…This contrast is similar to some of the methods explored in [2] for open vocabulary recognition, and we confirm their finding that unsupervised morphology learning gives similar results compared to the rule-based system. Further, unlike this and other prior work which uses morphology in Turkish broadcast news transcription [18,4,19,2], our study involves keyword spotting in conversational Turkish with minimal training resources (10 vs. roughly 200 hours).…”
Section: Introductionsupporting
confidence: 87%
“…ATWV on dev and eval-part1 test sets with dense lattices. Note: (2) and (3), as in all above experiments, use the word system (1) as a first stage.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations