2017
DOI: 10.1002/asi.23822
|View full text |Cite
|
Sign up to set email alerts
|

A local context‐aware LDA model for topic modeling in a document network

Abstract: With the rapid development of the Internet and its applications, growing volumes of documents increasingly become interconnected to form large‐scale document networks. Accordingly, topic modeling in a network of documents has been attracting continuous research attention. Most of the existing network‐based topic models assume that topics in a document are influenced by its directly linked neighbouring documents in a document network and overlook the potential influence from indirectly linked ones. The existing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 42 publications
0
6
0
Order By: Relevance
“…Network clustering is an unsupervised approach to clustering network objects with affinity [4,24,25,26,27]; document networks or social networks are often clustered to analyze datasets without expending much human effort. Network-based clustering is different from contentbased clustering in that it clusters documents based on connectivity.…”
Section: Network-based Clusteringmentioning
confidence: 99%
“…Network clustering is an unsupervised approach to clustering network objects with affinity [4,24,25,26,27]; document networks or social networks are often clustered to analyze datasets without expending much human effort. Network-based clustering is different from contentbased clustering in that it clusters documents based on connectivity.…”
Section: Network-based Clusteringmentioning
confidence: 99%
“…It is used to calculate the distances between documents and subsequently to cluster similar documents during documentclustering tasks. Some widely used document-embedding methods include those that use the term frequency (TF) or the inverse document frequency (IDF) [22], as well as the topic-modeling-based method called latent Dirichlet allocation (LDA) [6], [14], [19], [23]. Examples of studies in which these methods have been used for document clustering include one in which document clustering was performed by applying topic-modeling-based document embedding to the k-means algorithm [6].…”
Section: B Document-embedding-based Document Clusteringmentioning
confidence: 99%
“…Examples of studies in which these methods have been used for document clustering include one in which document clustering was performed by applying topic-modeling-based document embedding to the k-means algorithm [6]. In another study [14], the performance of topic modeling was improved by measuring the significance of documents based on their network; the improved method was then used for document clustering. In addition, research is currently underway on embedding documents or words using neural networks.…”
Section: B Document-embedding-based Document Clusteringmentioning
confidence: 99%
See 1 more Smart Citation
“…Song et al proposed a proximity probabilistic model (PPM), which uses a position‐dependent term count to represent both the number of occurrences of a term and the term counts propagated from other terms. Quite recently, topic models have been widely used to explore latent term associations in knowledge discovery and other related areas . Liu and Croft in particular proposed cluster‐based retrieval models under the language modeling framework, which has been used to smooth the probabilities in the document model.…”
Section: Related Workmentioning
confidence: 99%