Proceedings of the First Workshop on Language Technologies for African Languages - AfLaT '09 2009
DOI: 10.3115/1564508.1564527
|View full text |Cite
|
Sign up to set email alerts
|

Methods for Amharic part-of-speech tagging

Abstract: The paper describes a set of experiments involving the application of three state-ofthe-art part-of-speech taggers to Ethiopian Amharic, using three different tagsets. The taggers showed worse performance than previously reported results for English, in particular having problems with unknown words. The best results were obtained using a Maximum Entropy approach, while HMM-based and SVMbased taggers got comparable results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(22 citation statements)
references
References 16 publications
0
22
0
Order By: Relevance
“…Alongside these hubs, a large number of individual researchers is working on African Language Technology, not only in Africa (Abdillahi et al 2007;Muhirwe 2007), but in the Western world as well (Hurskainen 1992;Gambäck et al 2009;Gasser 2010;Dione et al 2010;Shah et al 2010). It falls beyond the scope of this overview to introduce all of the individual researchers in the field, but we would like to refer to AfLaT.org for a fairly exhaustive bibliography on African Language Technology.…”
Section: A Brief Overview Of African Language Technologymentioning
confidence: 97%
“…Alongside these hubs, a large number of individual researchers is working on African Language Technology, not only in Africa (Abdillahi et al 2007;Muhirwe 2007), but in the Western world as well (Hurskainen 1992;Gambäck et al 2009;Gasser 2010;Dione et al 2010;Shah et al 2010). It falls beyond the scope of this overview to introduce all of the individual researchers in the field, but we would like to refer to AfLaT.org for a fairly exhaustive bibliography on African Language Technology.…”
Section: A Brief Overview Of African Language Technologymentioning
confidence: 97%
“…This is mainly due: 1) to the absence of linguistical resources; 2) proprietary resources; 3) resources not in the electronic format. The most researched Ethiopic language is Amharic [1,2,9,11,13,23], which is also supported by Google. For the other languages as, e.g., Tigrinya, the research on different NLP topics are in their early stages (e.g., statistical machine translation from English to Tigrinya [3]).…”
Section: Related Workmentioning
confidence: 99%
“…A recent work investigated modeling POS tagging as an optimization problem using the genetic algorithm [1]. POS-tagging research for Amharic languages was mostly driven by a 210K-word news corpus that was tagged with 30 POS tags [6] Using this corpus, [8] reported various experiments applying Trigram'n'Tags (TnT), SVM, and maximum entropy algorithms. Furthermore, [10] identified inflectional and derivational patterns of Amharic words as features and improved tagging accuracy up to 90.95% using CRFs.…”
Section: Related Workmentioning
confidence: 99%
“…CRFs also solve the bias label problem that exists in maximum entropy Markov models (MEMMs) by training to predict the whole sequence correct instead of training to predict each label independently [12]. In the second approach, POS tagging is modeled as a classification problem and uses an SVM classifier [4] in a one-versus-therest multiclass scheme 8 . Both approaches are among the methods that have been successfully applied to develop stateof-the-art POS taggers 9 .…”
Section: Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation