Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2015
DOI: 10.3115/v1/n15-1076
|View full text |Cite
|
Sign up to set email alerts
|

Is Your Anchor Going Up or Down? Fast and Accurate Supervised Topic Models

Abstract: Topic models provide insights into document collections, and their supervised extensions also capture associated document-level metadata such as sentiment. However, inferring such models from data is often slow and cannot scale to big data. We build upon the "anchor" method for learning topic models to capture the relationship between metadata and latent topics by extending the vector-space representation of word-cooccurrence to include metadataspecific dimensions. These additional dimensions reveal new anchor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
17
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(17 citation statements)
references
References 28 publications
0
17
0
Order By: Relevance
“…Using an assumption of separability, these anchor words act as high precision markers of particular topics and, thus, help discern the topics from one another. Although the original algorithm proposed by Arora et al (2012), and subsequent improvements to their approach, find these anchor words automatically (Arora et al, 2013;Lee and Mimno, 2014), recent adaptations allow manual insertion of anchor words and other metadata Nguyen et al, 2015). Our work is similar to the latter, where we treat anchor words as fuzzy logic markers and embed them into the topic model in a semi-supervised fashion.…”
Section: Related Workmentioning
confidence: 99%
“…Using an assumption of separability, these anchor words act as high precision markers of particular topics and, thus, help discern the topics from one another. Although the original algorithm proposed by Arora et al (2012), and subsequent improvements to their approach, find these anchor words automatically (Arora et al, 2013;Lee and Mimno, 2014), recent adaptations allow manual insertion of anchor words and other metadata Nguyen et al, 2015). Our work is similar to the latter, where we treat anchor words as fuzzy logic markers and embed them into the topic model in a semi-supervised fashion.…”
Section: Related Workmentioning
confidence: 99%
“…As the acceptance of topic coherence measures increases as a mean of topic model assessment (Paul and Girju, 2010;Reisinger et al, 2010;Hall et al, 2012), recent research trends focus on proposing fast and efficient models that can be scaled up to big amounts of data (Yang et al, 2015;Nguyen et al, 2015), using the whole text per document for training.…”
Section: Related Workmentioning
confidence: 99%
“…We also associate each second-level frame node with an ideal point η k,j ∼ N (0, γ). This resembles how supervised topic models (Blei and McAuliffe, 2007;Nguyen et al, 2015) discover polarized topics' associated response variables.…”
Section: For Each Billmentioning
confidence: 99%