Variational Learning for Unsupervised Knowledge Grounded Dialogs

Mishra, Mayank; Madan, Dhiraj; Pandey, Gaurav; Contractor, Danish

doi:10.48550/arxiv.2112.00653

Cited by 1 publication

(1 citation statement)

References 18 publications

(40 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Retrieval based systems for dialog models have been applied in a variety of settings. Existing work has studied the problem of grounding responses in external knowledge such as documents [22,29,35], structured knowledge [30,37], with varying degrees of knowledge-level supervision [36,37]. In such cases, a knowledge instance is first fetched and then a response is generated.…”

Section: Response Retrieval Systemsmentioning

confidence: 99%

Mix-and-Match: Scalable Dialog Response Retrieval using Gaussian Mixture Embeddings

Pandey¹,

Contractor²,

Joshi³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Embedding-based approaches for dialog response retrieval embed the context-response pairs as points in the embedding space. These approaches are scalable, but fail to account for the complex, manto-many relationships that exist between context-response pairs. On the other end of the spectrum, there are approaches that feed the context-response pairs jointly through multiple layers of neural networks. These approaches can model the complex relationships between context-response pairs, but fail to scale when the set of responses is moderately large (>100). In this paper, we combine the best of both worlds by proposing a scalable model that can learn complex relationships between context-response pairs. Specifically, the model maps the contexts as well as responses to probability distributions over the embedding space. We train the models by optimizing the Kullback-Leibler divergence between the distributions induced by context-response pairs in the training data. We show that the resultant model achieves better performance as compared to other embedding-based approaches on publicly available conversation data.

show abstract