2015 6th International Conference on Information and Communication Systems (ICICS) 2015
DOI: 10.1109/iacs.2015.7103229
|View full text |Cite
|
Sign up to set email alerts
|

Scalable multi-label Arabic text classification

Abstract: Multi-label text classification (MTC) is a natural extension of the traditional text classification (TC) in which a possibly large set of labels can be assigned to each document. The dimensionality of labels makes MTC difficult and challenging. Several ways are proposed to ease the classification process and one of them is called the problem transformation (PT) method. It is used to transform the multi-labeled data into a singlelabel one that is suitable for normal classification. Our paper presents a detailed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 30 publications
(15 citation statements)
references
References 24 publications
0
15
0
Order By: Relevance
“…The developed model was trained using the standard corpus which is collected in [33], it contains 10,000 Arabic articles written in "modern standard Arabic language (MSA)", where those articles are assigned to five labels: Sports, Economy, Arts, Science, and Politics. In addition, three multi-label evaluation metrics were used to evaluate the given Arabic datasets which are average recall, average precision, and average Fmeasure.…”
Section: B Pt Methods Based On Binary Classificationmentioning
confidence: 99%
See 2 more Smart Citations
“…The developed model was trained using the standard corpus which is collected in [33], it contains 10,000 Arabic articles written in "modern standard Arabic language (MSA)", where those articles are assigned to five labels: Sports, Economy, Arts, Science, and Politics. In addition, three multi-label evaluation metrics were used to evaluate the given Arabic datasets which are average recall, average precision, and average Fmeasure.…”
Section: B Pt Methods Based On Binary Classificationmentioning
confidence: 99%
“…The study conducted in [33] aimed to handle the MLC problem of Arabic language based on transformation to multiclass classification approach using a set of single-label machine learning classifiers. The authors aimed to transform the MLC problem of Arabic data into several single-label classification problems by using MEKA tool to perform PT methods which are: LP, BR, and Ranking and Threshold-based method (RT).…”
Section: Pt Methods Based On Multi-class Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…Kusumaningrum, et al [25], similar in principle to this paper, considered classification of Indonesian news articles using latent Dirichlet allocation. Also [26] dwelt on Arabic text classification. The motivation to explore MNB for the current problem comes from the findings presented by [3].…”
Section: Literature Reviewmentioning
confidence: 99%
“…Arabic is the language of 22 countries with more than 400 million inhabitants. Natural Language Processing for the Arabic language is considered challenging due to the special characteristics of the language such as: orthography, the existence of short vowels, the complex morphology compared to English, the widespread of synonyms and the lack of publicly and freely accessible corpora [29], [35], [10].…”
Section: Introductionmentioning
confidence: 99%