2007
DOI: 10.1007/978-3-540-70939-8_13
|View full text |Cite
|
Sign up to set email alerts
|

ANERsys: An Arabic Named Entity Recognition System Based on Maximum Entropy

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
83
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 130 publications
(84 citation statements)
references
References 13 publications
1
83
0
Order By: Relevance
“…In (Nadeau and Sekine, 2007) Although rule based approaches proved successful to some extent, most recent NER research focuses on Statistical Learning techniques due to the shortcomings of rule based approaches in terms of coverage and robustness (Nadeau and Sekine, 2007). For example, (Benajiba et al, 2007) proposes an MSA NER system (ANERsys) based on n-grams and maximum entropy. The authors also introduce ANERCorp corpora and ANERGazet gazetteers.…”
Section: Related Workmentioning
confidence: 99%
“…In (Nadeau and Sekine, 2007) Although rule based approaches proved successful to some extent, most recent NER research focuses on Statistical Learning techniques due to the shortcomings of rule based approaches in terms of coverage and robustness (Nadeau and Sekine, 2007). For example, (Benajiba et al, 2007) proposes an MSA NER system (ANERsys) based on n-grams and maximum entropy. The authors also introduce ANERCorp corpora and ANERGazet gazetteers.…”
Section: Related Workmentioning
confidence: 99%
“…The Arabic Khoja's stemmer [23] was adopted for keywords stemming. The system was tested thoroughly to insure correctness of obtained results and then an extensive evaluation have been carried out to compare JAWEB performance with a well established web-based QA system that supports Arabic Language, Ask.com [10], as described in the following section.…”
Section: System Implementationmentioning
confidence: 99%
“…So it assumes that these names are found in a space of 10 words to the left and 10 words to the right of the trigger word. ANERsys which is a NER system built exclusively for Arabic texts based-on n-grams and maximum entropy approach is presented in Benajiba et al (2007). They developed their own corpora and gazetteers to train, evaluate and boost the system.…”
Section: Related Workmentioning
confidence: 99%
“…They obtained an improvement with respect to a baseline results without using any POS-tag information or text segmentation. In Benajiba et al (2010) the authors achieve a significant, highperformance Arabic NER system by using lexical, syntactic and morphological features. In Shaalan and Raza (2008) a system for Arabic NER is presented which consists of two main processing resources: a dictionary of names (whitelist or gazetteer), and a grammar, in the form of regular rules to recognize the Named entities.…”
Section: Related Workmentioning
confidence: 99%