2018
DOI: 10.14778/3192965.3192972
|View full text |Cite
|
Sign up to set email alerts
|

Scalable training of hierarchical topic models

Abstract: Large-scale topic models serve as basic tools for feature extraction and dimensionality reduction in many practical applications. As a natural extension of flat topic models, hierarchical topic models (HTMs) are able to learn topics of different levels of abstraction, which lead to deeper understanding and better generalization than their flat counterparts. However, existing scalable systems for flat topic models cannot handle HTMs, due to their complicated data structures such as trees and concurrent dynamica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
21
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(21 citation statements)
references
References 29 publications
0
21
0
Order By: Relevance
“…The most commonly used topic model is LDA [3], [13], [14], which is represented as a probabilistic graphical model in Figure 1, and it has shown great success in various NLP tasks for discovering latent topics in the biomedical literature [15], [16]. However, topics are naturally organized in a hierarchy [4], and the above models are flat topic models that cannot capture the hierarchical information of the topics.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The most commonly used topic model is LDA [3], [13], [14], which is represented as a probabilistic graphical model in Figure 1, and it has shown great success in various NLP tasks for discovering latent topics in the biomedical literature [15], [16]. However, topics are naturally organized in a hierarchy [4], and the above models are flat topic models that cannot capture the hierarchical information of the topics.…”
Section: Related Workmentioning
confidence: 99%
“…These models build a topic hierarchy that allows for arbitrarily large branch structures and adaptive dataset growth. In addition, they have been successfully applied for document modeling, online advertising and microblog location prediction and outperformed flat topic models [4].…”
Section: Introductionmentioning
confidence: 99%
“…Pachinko-allocation and hLDA are both focusing on the logic of topic hierarchy in real case. For example, the nested Chinese Restaurant Process used in hLDA refers to a topic model structure which is very close to the real case [13,42,43]. By taking the root node as the door of a Chinese restaurant, and its child nodes as tables, we may take each document as the customer, and figure out the path it follows.…”
Section: Relevant Research Of Topic Modelingmentioning
confidence: 99%
“…43.675232, -70.321317):3The final structure of the hierarchical range tree of the toy dataset is shown inFig. Anexample of 2-branching hierarchical tree with sub range trees as leaf nodes for the toy dataset…”
mentioning
confidence: 99%
“…Recentelly, researchers have proposed new approaches to effectively extract the hierarchical structure from text [94,17,112,22]. For example, Kim [47] and Titov [90] studied the problem by proposing a model that discovers a hierarchy from review data.…”
Section: Introductionmentioning
confidence: 99%