Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication 2014
DOI: 10.1145/2557977.2558028
|View full text |Cite
|
Sign up to set email alerts
|

Computing paper similarity based on latent dirichlet allocation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(6 citation statements)
references
References 16 publications
0
6
0
Order By: Relevance
“…The topic modelling technique known as LDA is used in [9,7] and considers two main concepts: 1) a single document can have several latent topics and 2) each topic can be drawn as a probability distribution of words (documents are represented as vectors of topics instead of bags of words). A supervised variant of LDA (sLDA) is proposed in [10], which incorporates a response variable (or class) when calculating the model of topics.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…The topic modelling technique known as LDA is used in [9,7] and considers two main concepts: 1) a single document can have several latent topics and 2) each topic can be drawn as a probability distribution of words (documents are represented as vectors of topics instead of bags of words). A supervised variant of LDA (sLDA) is proposed in [10], which incorporates a response variable (or class) when calculating the model of topics.…”
Section: Related Workmentioning
confidence: 99%
“…Regarding text distance metrics the literature is very extensive, being the Cosine similarity one of the most used techniques, like in [17,6]. Authors in [7] combine LDA representation with the Cosine function to calculate similarity among publications using the text of the title, abstract and author names. Other metrics used for both classification and grouping of text documents include simpler vector functions (Manhattan and Euclidean), set-based metrics (Jaccard) and entropy measure in probability distributions (KLD) [17,6,3].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations