Interpretable Probabilistic Embeddings: Bridging the Gap Between Topic Models and Neural Networks

Potapenko, Anna; Popov, Artem; Vorontsov, K. V.

doi:10.1007/978-3-319-71746-3_15

Cited by 10 publications

(4 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We observe that neural interpretation approaches fall within several broad categories: visualizations and heatmaps (Karpathy et al, 2015;Strobelt et al, 2016), gradient-based analyses (Potapenko et al, 2017;Samek et al, 2017b;Bach et al, 2015;Arras et al, 2017), learning disentangled representations during training (Whitney, 2016;Siddharth et al, 2017;Esmaeili et al, 2018), and model probes (Shi et al, 2016a;Adi et al, 2016;Conneau et al, 2018;Zhu et al, 2018;Kuncoro et al, 2018;Khandelwal et al, 2018). Our work uses linear probes as a method to identify the function of groups of neurons that are correlated with linguistic and tasklevel features, rather than for interpretation of individual neurons.…”

Section: Related Workmentioning

confidence: 99%

Deep Neural Model Inspection and Comparison via Functional Neuron Pathways

Fiacco¹,

Choudhary²,

Rosé³

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

We introduce a general method for the interpretation and comparison of neural models. The method is used to factor a complex neural model into its functional components, which are comprised of sets of co-firing neurons that cut across layers of the network architecture, and which we call neural pathways. The function of these pathways can be understood by identifying correlated task level and linguistic heuristics in such a way that this knowledge acts as a lens for approximating what the network has learned to apply to its intended task. As a case study for investigating the utility of these pathways, we present an examination of pathways identified in models trained for two standard tasks, namely Named Entity Recognition and Recognizing Textual Entailment.

show abstract

Section: Related Workmentioning

confidence: 99%

Deep Neural Model Inspection and Comparison via Functional Neuron Pathways

Fiacco¹,

Choudhary²,

Rosé³

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…The topic model simultaneously computes words and document embeddings and perform clusterization. It should be noted that in some cases topic model-based embeddings outperform traditional word embeddings, (Potapenko et al, 2017). The probability of the word w in the document d is represented by formula below:…”

Section: Topic Modelingmentioning

confidence: 99%

Unsupervised dialogue intent detection via hierarchical topic model

Popov¹,

Bulatov²,

Polyudova³

et al. 2019

Proceedings - Natural Language Processing in a Deep Learning World

Self Cite

View full text Add to dashboard Cite

One of the challenges during a taskoriented chatbot development is the scarce availability of the labeled training data. The best way of getting one is to ask the assessors to tag each dialogue according to its intent. Unfortunately, performing labeling without any provisional collection structure is difficult since the very notion of the intent is ill-defined.In this paper, we propose a hierarchical multimodal regularized topic model to obtain a first approximation of the intent set. Our rationale for hierarchical models usage is their ability to take into account several degrees of the dialogues relevancy. We attempt to build a model that can distinguish between subject-based (e.g. medicine and transport topics) and action-based (e.g. filing of an application and tracking application status) similarities. In order to achieve this, we divide set of all features into several groups according to part-of-speech analysis. Various feature groups are treated differently on different hierarchy levels.

show abstract

“…Li et al 2016;Hu and Tsujii 2016;Wang et al 2017;Le and Lauw 2017;Li et al 2018;Peng et al 2018;Zhang, Feng, and Liang 2019) aim at making topic models that work well with short documents like tweets, where too few words are employed (sparsity problem). Others target the problem of homonymy/polysemy (Liu et al 2015;Law et al 2017), seek more interpretable topics (Potapenko, Popov, and Vorontsov 2017;Zhao et al 2018), or aim at exploiting complementary representations (S. Moody 2016;Bunk and Krestel 2018). Often word embeddings are simply seen as a means to make a more realistic model (Das, Zaheer, and Dyer 2015;Batmanghelich et al 2016;Hu and Tsujii 2016;X.…”

Section: Previous Workmentioning

confidence: 99%

Mixing syntagmatic and paradigmatic information for concept detection

Chartrand,

Bouguessa

2019

Preprint

View full text Add to dashboard Cite

In the last decades, philosophers have begun using empirical data for conceptual analysis, but corpus-based conceptual analysis has so far failed to develop, in part because of the absence of reliable methods to automatically detect concepts in textual data. Previous attempts have shown that topic models can constitute efficient concept detection heuristics, but while they leverage the syntagmatic relations in a corpus, they fail to exploit paradigmatic relations, and thus probably fail to model concepts accurately. In this article, we show that using a topic model that models concepts on a space of word embeddings (Hu and Tsujii, 2016) can lead to significant increases in concept detection performance, as well as enable the target concept to be expressed in more flexible ways using word vectors.

show abstract

Interpretable Probabilistic Embeddings: Bridging the Gap Between Topic Models and Neural Networks

Cited by 10 publications

References 29 publications

Deep Neural Model Inspection and Comparison via Functional Neuron Pathways

Deep Neural Model Inspection and Comparison via Functional Neuron Pathways

Unsupervised dialogue intent detection via hierarchical topic model

Mixing syntagmatic and paradigmatic information for concept detection

Contact Info

Product

Resources

About