2020
DOI: 10.1007/s13278-020-00688-x
|View full text |Cite
|
Sign up to set email alerts
|

ArAutoSenti: automatic annotation and new tendencies for sentiment classification of Arabic messages

Abstract: A corpus-based sentiment analysis approach for messages written in Arabic and its dialects is presented and implemented. The originality of this approach resides in the automation construction of the annotated sentiment corpus, which relies mainly on a sentiment lexicon that is also constructed automatically. For the classification step, shallow and deep classifiers are used with features being extracted applying word embedding models. For the validation of the constructed corpus, we proceed with a manual revi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 91 publications
0
4
0
Order By: Relevance
“…This strategy was effective in enhancing the training evaluation and final predictions, resulting in an F1-score of 82%. In [15], this work presents a lexicon-based approach to build an annotated corpus for classifying the emotional state of Arabic messages using an Algerian Facebook dataset with 3048 messages (1488 positive and 1560 negative). The authors developed an algorithm that considers the morphology of Arabic and its dialects, along with a lexicon that was automatically made by using an existing English mood lexicon.…”
Section: Related Workmentioning
confidence: 99%
“…This strategy was effective in enhancing the training evaluation and final predictions, resulting in an F1-score of 82%. In [15], this work presents a lexicon-based approach to build an annotated corpus for classifying the emotional state of Arabic messages using an Algerian Facebook dataset with 3048 messages (1488 positive and 1560 negative). The authors developed an algorithm that considers the morphology of Arabic and its dialects, along with a lexicon that was automatically made by using an existing English mood lexicon.…”
Section: Related Workmentioning
confidence: 99%
“…More recently, the study of [27] presented and tested an approach for automation building of the annotated sentiment corpus of messages written in Arabic and its dialects. They also summarized some sentiment analysis articles' result for Modern Standard Arabic (MSA).…”
Section: Arabic Sentiment Analysis Challengesmentioning
confidence: 99%
“…During the last decade, the computer science field witnessed a considerable resort to word embedding models Arora and Kansal (2019), Tshimula et al (2020), Guellil et al (2020), Kejriwal and Zhou (2020).The principle of these models (also referred to as distributed word representation) is to map related words to nearby points in the space given a corpus of relationships. That is to say, words occurring in similar contexts have similar vector representations and geometric distances between them reflect the degree of their relationships.…”
Section: Word Embedding Modelsmentioning
confidence: 99%