2017
DOI: 10.1016/j.eswa.2017.03.020
|View full text |Cite
|
Sign up to set email alerts
|

Text classification method based on self-training and LDA topic models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
67
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 184 publications
(68 citation statements)
references
References 26 publications
0
67
0
1
Order By: Relevance
“…Based on the study carried out for this article, it has been found that most widely used text classification techniques follow semi-supervised learning approach (Deshmukh & Tripathy, 2017;Pavlinek & Podgorelec, 2017;W. Zhang, Tang, & Yoshida, 2015).…”
Section: Resultsmentioning
confidence: 99%
“…Based on the study carried out for this article, it has been found that most widely used text classification techniques follow semi-supervised learning approach (Deshmukh & Tripathy, 2017;Pavlinek & Podgorelec, 2017;W. Zhang, Tang, & Yoshida, 2015).…”
Section: Resultsmentioning
confidence: 99%
“…ABOUT HERE According to Guerreiro et al (2016), the ideal number of clusters/topics is attained when the variability explained does not change significantly by adding more clusters. A small number of topics produce topics that are too general, while a big number of topics may reduce the interpretability of the results (Pavlinek & Podgorelec, 2017). The number of optimal topics (K) was selected when there was a more negative likelihood after the model first stabilized its variability and just before the log-likelihood started to increase again.…”
Section: Insert Figure 4 About Herementioning
confidence: 99%
“…The topic modelling technique known as LDA is used in [9,7] and considers two main concepts: 1) a single document can have several latent topics and 2) each topic can be drawn as a probability distribution of words (documents are represented as vectors of topics instead of bags of words). A supervised variant of LDA (sLDA) is proposed in [10], which incorporates a response variable (or class) when calculating the model of topics.…”
Section: Related Workmentioning
confidence: 99%
“…Additionally, topic modelling has arisen as a promising alternative to the existing methods, by achieving good results in documents classification using probability distributions similarity, dimensional reduction and incorporating topics semantic [7,8,9]. Latent Dirichlet Allocation (LDA) is one of the most popular techniques in this area, using both unsupervised and supervised learning [10].…”
Section: Introductionmentioning
confidence: 99%