2003
DOI: 10.1007/978-3-540-24586-5_29
|View full text |Cite
|
Sign up to set email alerts
|

Selection of Lexical Units for Continuous Speech Recognition of Basque

Abstract: The selection of appropriate Lexical Units (LUs) is an important issue in the development of Continuous Speech Recognition (CSR) systems. Words have been used classically as the recognition unit in most of them. However, proposals of non-word units are beginning to arise. Basque is an agglutinative language with some structure inside words, for which non-word morpheme like units could be an appropriate choice. In this work a statistical analysis of units obtained after morphological segmentation has been carri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2003
2003
2010
2010

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 6 publications
0
11
0
Order By: Relevance
“…Table 1 shows the main features of the three textual samples relating to size, number of words and pseudo-morphemes and vocabulary size, both in words and pseudomorphemes for each database [6]. Figure 1 shows some of the interesting conclusions derived of this analysis.…”
Section: Morphological Features Of Basquementioning
confidence: 93%
See 2 more Smart Citations
“…Table 1 shows the main features of the three textual samples relating to size, number of words and pseudo-morphemes and vocabulary size, both in words and pseudomorphemes for each database [6]. Figure 1 shows some of the interesting conclusions derived of this analysis.…”
Section: Morphological Features Of Basquementioning
confidence: 93%
“…This approach has been evaluated over three textual samples analysing both the coverage and the Out of Vocabulary rate, when we use words and pseudo-morphemes obtained by the automatic morphological segmentation tool AHOZATI [6]. Table 1 shows the main features of the three textual samples relating to size, number of words and pseudo-morphemes and vocabulary size, both in words and pseudomorphemes for each database [6].…”
Section: Morphological Features Of Basquementioning
confidence: 99%
See 1 more Smart Citation
“…Then, the task underwent an automatic morphological segmentation and we created two sets of lexical units as alternative to the words. We considered these new lexical units because Basque is an aglutinative language [9]. Thus, MLA task reduces the vocabulary size to 35 pseudo-morphemes (PS-MORPHS).…”
Section: Experimental Evaluationmentioning
confidence: 99%
“…Thus, MLA task reduces the vocabulary size to 35 pseudo-morphemes (PS-MORPHS). Finally, N-WORDS acoustically more robust units [9] were obtained resulting in, 40. The sentences of MLA task were divided into 14,500 sentences for training and 500 for test.…”
Section: Experimental Evaluationmentioning
confidence: 99%