2019
DOI: 10.2478/bsrj-2019-0006
|View full text |Cite
|
Sign up to set email alerts
|

Albanian Text Classification: Bag of Words Model and Word Analogies

Abstract: Background: Text classification is a very important task in information retrieval. Its objective is to classify new text documents in a set of predefined classes, using different supervised algorithms. Objectives: We focus on the text classification for Albanian news articles using two approaches. Methods/Approach: In the first approach, the words in a collection are considered as independent components, allocating to each of them a conforming vector in the vector’s space. Here we utilized nine classifiers fro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(7 citation statements)
references
References 11 publications
0
7
0
Order By: Relevance
“…Later on, Kadriu et al [24] presented an extended study concerning text classification for Albanian news articles, which was based on a bag of words model and word analogies. In this study, the authors focused on text classification using two approaches: i) the text classification treats words as independent components and ii) the text classification treats words based on their semantic and syntactic word similarities.…”
Section: Text Classificationmentioning
confidence: 99%
“…Later on, Kadriu et al [24] presented an extended study concerning text classification for Albanian news articles, which was based on a bag of words model and word analogies. In this study, the authors focused on text classification using two approaches: i) the text classification treats words as independent components and ii) the text classification treats words based on their semantic and syntactic word similarities.…”
Section: Text Classificationmentioning
confidence: 99%
“…(1) Based on machine learning (Kadriu et al, 2019;Khairnar et al, 2013;Le et al, 2017) and (2) based on lexicon (Liu, 2012(Liu, , 2017Vu et al, 2017;Li et al, 2019). In addition, to increase the efficiency of the opinion mining method, the research has used a hybrid method combining machine learning and lexicon (Mudambi et al, 2010;Maks et al, 2012;Sun et al, 2017;Yang et al, 2017).…”
Section: Customer Opinion Mining In Online Servicesmentioning
confidence: 99%
“…Due to some characteristics of the language on social networks, such as a limited number of characters or emotions depending heavily on what users are reading and listening to, the emotional classification of users in social networks is a challenging issue. Machine learning has been applied and has achieved some success in sentiment analysis (Khairnar et al, 2013;Kadriu et al, 2019).…”
Section: Machine Learning-based Customer Sentiment Analysismentioning
confidence: 99%
“…Using orange 3, an opensource data mining application [15] this study used the Liu-Hu sentiment analysis method for the following reasons: i) it provides a reasonable accuracy [16] with the dataset of this work; ii) Liu-Hu method in orange is free; iii) Liu-Hu in orange is easy to use for sentiment analysis; iv) it does not need a training dataset to analyze sentiment; and v) proffer a clear view of negative and positive sentiment based on the specific topic. Bag of words model in Spyder is used to classify text and to count word occurrence from social media and to identify the frequently discussed topic for the following reasons: i) bag-of-word is simple to understand; ii) easy to implement; iii) bag-of-word works faster for our dataset; and iv) many scholars prefer bag-of-word for [17] text classification.…”
Section: Introductionmentioning
confidence: 99%