2011
DOI: 10.1002/asi.21598
|View full text |Cite
|
Sign up to set email alerts
|

OCA: Opinion corpus for Arabic

Abstract: Sentiment analysis is a challenging new task related to text mining and natural language processing. Although there are, at present, several studies related to this theme, most of these focus mainly on English texts. The resources available for opinion mining (OM) in other languages are still limited. In this article, we present a new Arabic corpus for the OM task that has been made available to the scientific community for research purposes. The corpus contains 500 movie reviews collected from different web p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
155
0
1

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 218 publications
(162 citation statements)
references
References 26 publications
0
155
0
1
Order By: Relevance
“…SVM has outperformed other machine-learning techniques because of its primary advantages. For instance, generality of text categorisation dilemmas are linearly separable, it's robust in high-dimensional spaces and powerful when there is a sparse set of samples and any feature is pertinent (Rushdi-Saleh et al, 2011). The fundamental conception of SVM is to find a hyper-plane represented by a vector that does not only dissever the document vectors into a different class from those in other documents.…”
Section: Classification Methodsmentioning
confidence: 99%
“…SVM has outperformed other machine-learning techniques because of its primary advantages. For instance, generality of text categorisation dilemmas are linearly separable, it's robust in high-dimensional spaces and powerful when there is a sparse set of samples and any feature is pertinent (Rushdi-Saleh et al, 2011). The fundamental conception of SVM is to find a hyper-plane represented by a vector that does not only dissever the document vectors into a different class from those in other documents.…”
Section: Classification Methodsmentioning
confidence: 99%
“…It has outperformed other machine learning techniques due to the associated advantage. For instance, the powerful in highdimensional spaces [19].…”
Section: Classification Methodsmentioning
confidence: 99%
“…It is collected from three different resources: Penn Arabic Treebank (PATB) which is a collection of news wire topics of different domains, Wikipedia user talk pages and a user conversation on web forum sites. Opinion corpus for Arabic (OCA) is built by Rushdi-Saleh [22]. The data is collected from several blogs of movie reviews, obtaining a total of 500 comments (250 positive and 250 negative).…”
Section: Related Workmentioning
confidence: 99%