Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP) 2014
DOI: 10.3115/v1/w14-3617
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Correction of Arabic Text: a Cascaded Approach

Abstract: This paper describes the error correction model that we used for the Automatic Correction of Arabic Text shared task. We employed two correction models, namely a character-level model and a casespecific model, and two punctuation recovery models, namely a simple statistical model and a CRF model. Our results on the development set suggest that using a cascaded correction model yields the best results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
15
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(16 citation statements)
references
References 7 publications
1
15
0
Order By: Relevance
“…In The companion volume of the proceedings of international joint conference on natural language processing (ijcnlp) (pp. [13][14][15][16]. …”
Section: Referencesmentioning
confidence: 99%
See 1 more Smart Citation
“…In The companion volume of the proceedings of international joint conference on natural language processing (ijcnlp) (pp. [13][14][15][16]. …”
Section: Referencesmentioning
confidence: 99%
“…Techniques and tools reported in the literature for supporting the Arabic spelling errors detection and correction task include morphological analysis [12] [16], finite state transducer with edit distance [9] [8], statistical character level transformation [14], N-gram scores [17] [8], conditional random fields [14] [8], and Naïve Base [15]. Similar to systems described in the literature, Arib utilizes language resources such as dictionaries and corpora as well as the application of different techniques to support the task of Arabic spelling error detection and correction.…”
Section: Related Workmentioning
confidence: 99%
“…Significant contributions were also introduced in the 2014 Shared Task on Arabic Error Correction including Nawar and Ragheb, 2014;Jeblee et al, 2014;and Mubarak and Darwish, 2014).…”
Section: Introductionmentioning
confidence: 99%
“…Team Name Affiliation CLMB Columbia University (USA) CMUQ (Jeblee et al, 2014) Carnegie Mellon University in Qatar (Qatar) CP13 Université Paris 13 (France) CUFE (Nawar and Ragheb, 2014) Computer Engineering Department, Cairo University (Egypt) GLTW (Zerrouki et al, 2014) Bouira University (Algeria), The National Computer Science Engineering School (Algeria), and Tabuk University (KSA) GWU (Attia et al, 2014) George Washington University (USA) QCRI (Mubarak and Darwish, 2014) Qatar Computing Research Institute (Qatar) TECH (Mostefa et al, 2014) Techlimed.com (France) YAM (Hassan et al, 2014) Faculty of Engineering, Cairo University (Egypt) Table 6: Approaches adopted by the participating teams.…”
Section: Shared Task Datamentioning
confidence: 99%