On Learning Prediction-Focused Mixtures

Sharma, A.; Zeng, Catherine; Narayanan, Sanjana; Parbhoo, Sonali; Doshi‐Velez, Finale

doi:10.48550/arxiv.2110.13221

Cited by 1 publication

(1 citation statement)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The upstream setting allows us to implicitly train our classifier and topic model in a one-stage setting that is end-to-end. This has the benefit of allowing us to tune the trade-off between our classifier and topic model performance in a predictionconstrained framework, which has been shown to achieve better empirical results when latent variable models are used as a dimensionality reduction tool (Hughes et al, 2018;Sharma et al, 2021). Furthermore, the upstream setting allows us to introduce the document label classifier as a latent variable, enabling our model to work in semisupervised settings.…”

Section: Downstream Supervised Topic Modelsmentioning

confidence: 99%

A Joint Learning Approach for Semi-supervised Neural Topic Modeling

Jeffrey¹,

Mittal²,

Neehal³

et al. 2022

Proceedings of the Sixth Workshop on Structured Prediction for NLP

Self Cite

View full text Add to dashboard Cite

Topic models are some of the most popular ways to represent textual data in an interpretable manner. Recently, advances in deep generative models, specifically auto-encoding variational Bayes (AEVB), have led to the introduction of unsupervised neural topic models, which leverage deep generative models as opposed to traditional statistics-based topic models. We extend upon these neural topic models by introducing the Label-Indexed Neural Topic Model (LI-NTM), which is, to the extent of our knowledge, the first effective upstream semisupervised neural topic model. We find that LI-NTM outperforms existing neural topic models in document reconstruction benchmarks, with the most notable results in low labeled data regimes and for data-sets with informative labels; furthermore, our jointly learned classifier outperforms baseline classifiers in ablation studies.

show abstract

Section: Downstream Supervised Topic Modelsmentioning

confidence: 99%