2009
DOI: 10.2991/ijcis.2009.2.4.8
|View full text |Cite
|
Sign up to set email alerts
|

Text Categorization Based on Topic Model

Abstract: In the text literature, many topic models were proposed to represent documents and words as topics or latent topics in order to process text effectively and accurately. In this paper, we propose LDACLM or Latent Dirichlet Allocation Category Language Model for text categorization and estimate parameters of models by variational inference. As a variant of Latent Dirichlet Allocation Model, LDACLM regards documents of category as Language Model and uses variational parameters to estimate maximum a posteriori of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 3 publications
0
1
0
Order By: Relevance
“…Documents are represented as a mixture of topic distributions and topics as a mixture of words distributions. The representations are inferred from a large training corpus, and when used for text categorization, information about the text categories is not taken into account (e.g., [14,52,55,56]). Interestingly, Siefkes et al [45] perform spam filtering based on orthogonal sparse bigrams.…”
Section: Related Workmentioning
confidence: 99%
“…Documents are represented as a mixture of topic distributions and topics as a mixture of words distributions. The representations are inferred from a large training corpus, and when used for text categorization, information about the text categories is not taken into account (e.g., [14,52,55,56]). Interestingly, Siefkes et al [45] perform spam filtering based on orthogonal sparse bigrams.…”
Section: Related Workmentioning
confidence: 99%