Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2021
DOI: 10.18653/v1/2021.naacl-main.300
|View full text |Cite
|
Sign up to set email alerts
|

Topic Model or Topic Twaddle? Re-evaluating Semantic Interpretability Measures

Abstract: When developing topic models, a critical question that should be asked is: How well will this model work in an applied setting? Because standard performance evaluation of topic interpretability uses automated measures modeled on human evaluation tests that are dissimilar to applied usage, these models' generalizability remains in question. In this paper, we probe the issue of validity in topic model evaluation and assess how informative coherence measures are for specialized collections used in an applied sett… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(19 citation statements)
references
References 64 publications
0
19
0
Order By: Relevance
“…This type of post-hoc expert informed inspection builds faith in the face validity of the learned model; however, it is criticized for lacking rigor compared with alternative approaches based on quantitative evaluation metrics. 12,13…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…This type of post-hoc expert informed inspection builds faith in the face validity of the learned model; however, it is criticized for lacking rigor compared with alternative approaches based on quantitative evaluation metrics. 12,13…”
Section: Methodsmentioning
confidence: 99%
“…For many of the learned thematic vectors we observed a strong degree of semantic overlap between the words/tokens (summarizing fitted topical vectors) and the ICD-9 diagnostic codes identified as being most strongly associated with the thematic vector. Subjectively, the following topical vectors demonstrated reasonable convergent/discriminant validity: (21,27,13,18,26,15,47,48,11,50,41,23,39,3,14,32,38,4,5,8,46,25,7,9,45). Below, we identified a subset of thematic vectors for which the words/tokens loading strongly on topical basis appeared semantically associated with assigned primary diagnostic codes, suggesting they may be measuring the same latent construct:…”
Section: Topic Model Summarization and Association Between Learned To...mentioning
confidence: 95%
See 1 more Smart Citation
“…On the other hand, the topics discovered by ETM are more stable but have a lower coherence on average. As already observed in previous work (Al-Sumait et al, 2009;Doogan and Buntine, 2021), obtaining junk or mixed topics is common in topic models and this problem can be addressed by filtering out the topics that are less relevant.…”
Section: Qualitative Resultsmentioning
confidence: 85%
“… 2018 ; Hoyle et al. 2020 ; Doogan and Buntine 2021 ). Recent approaches to modelling short text datasets include the use of auxiliary metadata (Zhao et al.…”
Section: Introduction and Motivationsmentioning
confidence: 99%