2017
DOI: 10.1007/s10994-017-5689-6
|View full text |Cite
|
Sign up to set email alerts
|

Online multi-label dependency topic models for text classification

Abstract: Multi-label text classification is an increasingly important field as large amounts of text data are available and extracting relevant information is important in many application contexts. Probabilistic generative models are the basis of a number of popular text mining methods such as Naive Bayes or Latent Dirichlet Allocation. However, Bayesian models for multi-label text classification often are overly complicated to account for label dependencies and skewed label frequencies while at the same time preventi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
30
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 50 publications
(31 citation statements)
references
References 22 publications
0
30
0
1
Order By: Relevance
“…There are many studies trying to exploit the label correlations for enhancing the multilabel learning (Zhu, Kwok, and Zhou 2017;Huang, Yu, and Zhou 2012). Some of them focus on pairwise correlation (Li, Song, and Luo 2017), while some others consider high order correlation among all labels (Burkhardt and Kramer 2018;Read et al 2011).…”
Section: Related Workmentioning
confidence: 99%
“…There are many studies trying to exploit the label correlations for enhancing the multilabel learning (Zhu, Kwok, and Zhou 2017;Huang, Yu, and Zhou 2012). Some of them focus on pairwise correlation (Li, Song, and Luo 2017), while some others consider high order correlation among all labels (Burkhardt and Kramer 2018;Read et al 2011).…”
Section: Related Workmentioning
confidence: 99%
“…Although be used to some extent, the label correlations cannot be confined to pairwise labels under realworld scenarios. The high-order method consider the correlations among label subsets or the whole class labels [18]- [20]. High-order methods could address more realistic label correlations, while it may suffer from high model complexities.…”
Section: Related Workmentioning
confidence: 99%
“…Since SLC is merely a special case, MLC deals with a more difficult and general problem in the data mining domain, focusing on the following two challenges. (1) The number of label sets may be very large for test instances (e.g. being exponentially proportional to the total number of labels).…”
Section: Introductionmentioning
confidence: 99%
“…However, little progress has been made with these methods and algorithms [15][16][17][18][19][20][21][22][23][24][25][26][27][28][29] especially in the following aspects. (1) Complexity: in order to construct a graph model, a graph-based MLC method must map the instances of an entire training set to graph vertices, with many instances irrelevant to the test instances causing very high requirements for time and space and causing high computational complexity. (2) Heterogeneity: for similarity measurement among instances, nearly none of these methods consider the difference between discrete features and continuous features.…”
Section: Introductionmentioning
confidence: 99%