2019
DOI: 10.48550/arxiv.1906.01830
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ArSentD-LEV: A Multi-Topic Corpus for Target-based Sentiment Analysis in Arabic Levantine Tweets

Abstract: Sentiment analysis is a highly subjective and challenging task. Its complexity further increases when applied to the Arabic language, mainly because of the large variety of dialects that are unstandardized and widely used in the Web, especially in social media. While many datasets have been released to train sentiment classifiers in Arabic, most of these datasets contain shallow annotation, only marking the sentiment of the text unit, as a word, a sentence or a document. In this paper, we present the Arabic Se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 31 publications
0
10
0
Order By: Relevance
“…evaluate AraBERT on 3 Arabic downstream tasks. These are (1) sentiment analysis from 6 different datasets: HARD (Elnagar et al, 2018), ASTD (Nabil et al, 2015), ArsenTD-Lev (Baly et al, 2019), LABR (Aly and Atiya, 2013), and ArSaS (Elmadany et al, 2018). (2) NER, with the ANERcorp (Columbia University, 2016), and (3) Arabic QA, on Arabic-SQuAD (Mozannar et al, 2019) and ARCD (Mozannar et al, 2019) datasets.…”
Section: Arabic Lmsmentioning
confidence: 99%
See 2 more Smart Citations
“…evaluate AraBERT on 3 Arabic downstream tasks. These are (1) sentiment analysis from 6 different datasets: HARD (Elnagar et al, 2018), ASTD (Nabil et al, 2015), ArsenTD-Lev (Baly et al, 2019), LABR (Aly and Atiya, 2013), and ArSaS (Elmadany et al, 2018). (2) NER, with the ANERcorp (Columbia University, 2016), and (3) Arabic QA, on Arabic-SQuAD (Mozannar et al, 2019) and ARCD (Mozannar et al, 2019) datasets.…”
Section: Arabic Lmsmentioning
confidence: 99%
“…They fine-tune these two versions on 5 sentiment datasets. These are HARD (Elnagar et al, 2018), the balanced data for ASTD (which we will refer to as ASTD-B) (Nabil et al, 2015), ArSenTD-Lev (Baly et al, 2019), AJGT (Alomari et al, 2017), and the unbalanced positive and negative classes for LABR (Aly and Atiya, 2013). split the data into 80/20 for training/test, respectively and report results in accuracy using the best epoch identified on test data.…”
Section: Baselinesmentioning
confidence: 99%
See 1 more Smart Citation
“…Baly et al ( 2019) [8] created: the Multi-Dialect Arabic Sentiment Twitter Dataset, and the Arabic Sentiment Twitter Dataset for the Levantine dialect. The authors experimented with SVM, logistic regression and random forest trees classifiers using POS tags, numbers of positive/negative emoticons and words from different lexicons, Twitter-specific features, etc., in addition to several deep learning models.…”
Section: Heikal Et Al (2018)mentioning
confidence: 99%
“…We collect 15 datasets related to sentiment analysis of Arabic, including MSA and dialects (Abdul-Mageed and Diab, 2012;Abdulla et al, 2013;Abdul-Mageed et al, 2014b;Nabil et al, 2015;Kiritchenko et al, 2016;Aly and Atiya, 2013;Salameh et al, 2015;Rosenthal et al, 2017;Alomari et al, 2017;Mohammad et al, 2018;Baly et al, 2019).…”
Section: Sentimentmentioning
confidence: 99%