2007
DOI: 10.1007/978-3-540-74999-8_5
|View full text |Cite
|
Sign up to set email alerts
|

Amharic-English Information Retrieval

Abstract: We describe Amharic-English cross lingual information retrieval experiments in the adhoc bilingual tracs of the CLEF 2006. The query analysis is supported by morphological analysis and part of speech tagging while we used different machine readable dictionaries for term lookup in the translation process. Out of dictionary terms were handled using fuzzy matching and Lucene[4] was used for indexing and searching. Four experiments that differed in terms of utilized fields in the topic set, fuzzy matching, and ter… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…LM is a popular model for the development of IR systems, but it has not been used in previous Amharic IR ones. Many of them are rather based on vector space model [13,20,32,34]. Here, we investigated the effect of LM model on Amharic IR and compared it with BM25.…”
Section: Resultsmentioning
confidence: 99%
“…LM is a popular model for the development of IR systems, but it has not been used in previous Amharic IR ones. Many of them are rather based on vector space model [13,20,32,34]. Here, we investigated the effect of LM model on Amharic IR and compared it with BM25.…”
Section: Resultsmentioning
confidence: 99%
“…The average precisions of 0.3615 and 0.4009 were achieved using IDF and stopword list, respectively. Argaw et al [46] also have developed dictionary-based Amharic-French IR system with and without word sense discrimination using bag-of-words approach. Word sense discrimination refers to the distinction between different word meanings without sense assignment [47].…”
Section: Related Workmentioning
confidence: 99%
“…Few Amharic IR systems were developed in the past. The majority of them are based on stems [26,50,55]; some others are based on citation form [46] and roots [25]. However, due to the complexity of the language, the stem-based, and citation form, and n-gram models do not work well.…”
Section: Comparison With Previous Amharic Ir Studiesmentioning
confidence: 99%
“…As a result, only few Amharic NLP tools have been developed thus far using rule-based and machine learning approaches. Among the available tools are morphological analyzers [Argaw & Asker, 2006;Gasser, 2011;Abate & Assabie, 2014], stemmer [Alemayehu & Willett, 2003], and parser [Argaw & Asker, 2006;Tachbelie et al, 2011]. However, the development of these tools is at prototype stages.…”
Section: Resourcesmentioning
confidence: 99%