Proceedings of the 9th Linguistic Annotation Workshop 2015
DOI: 10.3115/v1/w15-1614
|View full text |Cite
|
Sign up to set email alerts
|

Correction Annotation for Non-Native Arabic Texts: Guidelines and Corpus

Abstract: We present our correction annotation guidelines to create a manually corrected nonnative (L2) Arabic corpus. We develop our approach by extending an L1 large-scale Arabic corpus and its manual corrections, to include manually corrected non-native Arabic learner essays. Our overarching goal is to use the annotated corpus to develop components for automatic detection and correction of language errors that can be used to help Standard Arabic learners (native and non-native) improve the quality of the Arabic text … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0
2

Year Published

2015
2015
2022
2022

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 24 publications
(30 citation statements)
references
References 13 publications
0
27
0
2
Order By: Relevance
“…The objective of the model is to minimize the error of the network output versus the true sentiment class label for each training case. The remaining settings for DNN model are: (1) conjugate gradient algorithm is used for gradient updates with three line searches; (2) weights are randomly initialized from Gaussian distribution of 0 mean and standard deviation of 1; and (3) the activation function of each neuron is taken as hyperbolic tangent activation. Training is conducted in batches of size 100 cases for 50 epochs.…”
Section: Deep Learning Models For Sentiment Analysis In Arabicmentioning
confidence: 99%
See 4 more Smart Citations
“…The objective of the model is to minimize the error of the network output versus the true sentiment class label for each training case. The remaining settings for DNN model are: (1) conjugate gradient algorithm is used for gradient updates with three line searches; (2) weights are randomly initialized from Gaussian distribution of 0 mean and standard deviation of 1; and (3) the activation function of each neuron is taken as hyperbolic tangent activation. Training is conducted in batches of size 100 cases for 50 epochs.…”
Section: Deep Learning Models For Sentiment Analysis In Arabicmentioning
confidence: 99%
“…2 Table 1 presents a sample Arabic news comment along with its manually corrected form, its romanized transliteration, 3 and the English translation. The errors in the original and the corrected forms are underlined and co-indexed.…”
Section: The Qalb Corpusmentioning
confidence: 99%
See 3 more Smart Citations