2017
DOI: 10.1007/978-3-319-56608-5_6
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Re-estimation of Topic Models for Measuring Topical Diversity

Abstract: A high degree of topical diversity is often considered to be an important characteristic of interesting text documents. A recent proposal for measuring topical diversity identifies three elements for assessing diversity: words, topics, and documents as collections of words. Topic models play a central role in this approach. Using standard topic models for measuring diversity of documents is suboptimal due to generality and impurity. General topics only include common information from a background corpus and ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 16 publications
0
8
0
Order By: Relevance
“…We relied on standard LDA for building our topic models. However, very recently, Azarbonyad et al (2017) have proposed that hierarchical re-estimation of topic models can lead to better estimates of topic diversity. While they are interested in calculating the diversity within a document, we are more interested in differences between documents, future research should evaluate in how far their approach could improve ours.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…We relied on standard LDA for building our topic models. However, very recently, Azarbonyad et al (2017) have proposed that hierarchical re-estimation of topic models can lead to better estimates of topic diversity. While they are interested in calculating the diversity within a document, we are more interested in differences between documents, future research should evaluate in how far their approach could improve ours.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…Instead of formulating user intents for a query as a flat list of subtopics, Hu et al (2015) presented hierarchical diversification models. Hierarchical topic models are estimated for measuring topical diversity of documents in Azarbonyad et al (2017). Wang et al (2016a) investigated methods of evaluating search result diversity using intent hierarchies Result diversification has also been investigated in various specific search applications such as image retrieval (Ionescu et al, 2016), historic entity or event search (Gupta and Berberich, 2016), medical records retrieval (Li et al, 2015), music recommendations (Schedl and Hauger, 2015), search in Twitter (Wang et al, 2016b), among others.…”
Section: Results Diversificationmentioning
confidence: 99%
“…In recommender systems and search engine retrieval, diversified results are considered more informative and useful to help consumer mitigate information overload than highly homogeneous results (Boim et al, 2011;Drosou & Pitoura, 2012;Hurley & Zhang, 2011). In the text mining area, several studies have proposed approaches to measure topical diversity of documents, which indicates how diverse a text document is relative to other documents in a corpus (Azarbonyad et al, 2017;Bache et al, 2013;Derzinski & Rohanimanesh, 2014;Liang et al, 2014). Most of these approaches first extract topics from documents using topic modeling techniques and then estimate the diversity of documents using the extracted topics.…”
Section: Measuring Text Diversitymentioning
confidence: 99%