Proceedings of the 17th International Conference on Computational Linguistics - 1998
DOI: 10.3115/980451.980909
|View full text |Cite
|
Sign up to set email alerts
|

Combining stochastic and rule-based methods for disambiguation in agglutinative languages

Abstract: LaburpenaArtikulu honetan metodo estokastiko eta erregeletan oinarritutako metodoen arteko konbinaketa euskarari aplikatzearen emaitzak aurkeztuko ditugu.Desanbiguazioan erabilitako metodoak Murrizpen Gramatika (CG) eta MULTEXT proiektuak garatutako HMMn oinarritutako etiketatzailea dira. Euskara hizkuntza eranskaria izaki, hitz bakoitzari dagozkion irakurketa guztiak esleitzeko analizatzaile morfologikoa beharrezkoa da. Ondoren, CG erregelak informazio morfologiko guztiari aplikatzen zaizkio eta prozesu honek… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0
1

Year Published

2001
2001
2017
2017

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 0 publications
0
14
0
1
Order By: Relevance
“…In the present study, we have chosen a different strategy (similar to the one described for other types of languages in (Tapanainen and Voutilainen, 1994), (Ezeiza et al, 1998) and (Hakkani-Tur et al, 2000)). At the same time, the rulebased component is known to perform well in eliminating the incorrect alternatives 2 , rather than picking the correct one under all circumstances.…”
Section: System Combinationmentioning
confidence: 99%
“…In the present study, we have chosen a different strategy (similar to the one described for other types of languages in (Tapanainen and Voutilainen, 1994), (Ezeiza et al, 1998) and (Hakkani-Tur et al, 2000)). At the same time, the rulebased component is known to perform well in eliminating the incorrect alternatives 2 , rather than picking the correct one under all circumstances.…”
Section: System Combinationmentioning
confidence: 99%
“…With regard to the feature vectors, the computation of the POS information was performed using the Eustagger toolkit [34] and ixa-pipe-pos [35] for the Basque and Spanish languages respectively. In addition, the time-codes at word level were obtained through the audio forced-alignment algorithms presented in [36] for both languages.…”
Section: Basque Corpus Spanish Large Corpusmentioning
confidence: 99%
“…Another work describing the combined approach is [10]. Its authors introduce the results of combining the statistical and deterministic methods on the basis of the Basque language.…”
Section: Rule-based Approachesmentioning
confidence: 99%