Structured Fusion Networks for Dialog

Mehri, Shikib; Srinivasan, Tejas; Eskénazi, Maxine

doi:10.18653/v1/w19-5921

Cited by 88 publications

(107 citation statements)

References 40 publications

Supporting

Mentioning

107

Contrasting

Order By: Relevance

“…However, the recently proposed multi-domain taskoriented dialogue datasets Eric et al, 2019) bring new challenges for multi-domain dialogue state tracking and response generation. Several follow up works ; Budzianowski and Vulić, 2019; Mehri et al, 2019;Madotto et al, 2020b) improved on the initial baselines with various methodologies. proposed the domain aware multi-decoder network and augmented the system act labels by leveraging the user act annotation, achieving the SOTA results in MultiWoz.…”

Section: Related Workmentioning

confidence: 99%

“…For the end-to-end dialogue modeling task, there are three automatic metrics to evaluate the response quality: 1) Inform rate: if the system provides a correct entity, 2) Success rate: if the system provides the correct entity and answers all the requested information, 3) BLEU (Papineni et al, 2002) for measuring the fluency of the generated response. Following previous work (Mehri et al, 2019), we also report the combined score, i.e., Combined = (Inform + Success)×0.5 + BLEU, as an overall quality measure. Joint goal accuracy (Joint Acc.)…”

Section: Evaluation Metricsmentioning

confidence: 99%

“…SFN + RL: a seq2seq network comprised of several pre-trained dialogue modules that are connected through hidden states. Reinforcement fine tuning is used additionally to train the model (Mehri et al, 2019). MD-Sequicity: an extension of the Sequicity (Lei et al, 2018) framework for multi-domain task-oriented dialogue by .…”

Section: End-to-end Modelingmentioning

confidence: 99%

See 2 more Smart Citations

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

Lin¹,

Madotto²,

Winata³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

In this paper, we propose Minimalist Transfer Learning (MinTL) to simplify the system design process of task-oriented dialogue systems and alleviate the over-dependency on annotated data. MinTL is a simple yet effective transfer learning framework, which allows us to plug-and-play pre-trained seq2seq models, and jointly learn dialogue state tracking and dialogue response generation. Unlike previous approaches, which use a copy mechanism to "carryover" the old dialogue states to the new one, we introduce Levenshtein belief spans (Lev), that allows efficient dialogue state tracking with a minimal generation length. We instantiate our learning framework with two pretrained backbones: T5 (Raffel et al., 2019) and BART (Lewis et al., 2019), and evaluate them on MultiWOZ. Extensive experiments demonstrate that: 1) our systems establish new state-of-the-art results on end-to-end response generation, 2) MinTL-based systems are more robust than baseline methods in the low resource setting, and they achieve competitive results with only 20% training data, and 3) Lev greatly improves the inference efficiency 1 .

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Evaluation Metricsmentioning

confidence: 99%

See 1 more Smart Citation

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

Lin¹,

Madotto²,

Winata³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

show abstract

“…End-to-end task-oriented dialog systems. Our model belongs to the family of E2E task-oriented dialog models (Wen et al, 2017a,b;Li et al, 2017;Mehri et al, 2019;Peng et al, 2020;Hosseini-Asl et al, 2020). We borrow some elements from the Sequicity ) model, such as representing the belief state as a natural language sequence (a text span), and using copy-augmented Seq2Seq learning (Gu et al, 2016).…”

Section: Related Workmentioning

confidence: 99%

“…The TSCP , SEDST , FSDM (Shu et al, 2019), MOSS (Liang et al, 2020) and DAMD are based on the copy-augmented Seq2Seq learning framework proposed by . LIDM (Wen et al, 2017a), SFN (Mehri et al, 2019) and UniConv (Le et al, 2020a) are modular designed, connected through neural states and trained end-to-end. SimpleTOD (Hosseini-Asl et al, 2020) and SOLOLIST (Peng et al, 2020) are two recent models, which both use a single autoregressive language model, initialized from GPT-2, to build the entire system.…”

Section: Baselinesmentioning

confidence: 99%

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

Zhang¹,

Ou²,

Hu³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Structured belief states are crucial for user goal tracking and database query in task-oriented dialog systems. However, training belief trackers often requires expensive turn-level annotations of every user utterance. In this paper we aim at alleviating the reliance on belief state labels in building end-to-end dialog systems, by leveraging unlabeled dialog data towards semi-supervised learning. We propose a probabilistic dialog model, called the LAtent BElief State (LABES) model, where belief states are represented as discrete latent variables and jointly modeled with system responses given user inputs. Such latent variable modeling enables us to develop semi-supervised learning under the principled variational learning framework. Furthermore, we introduce LABES-S2S, which is a copyaugmented Seq2Seq model instantiation of LABES 1 . In supervised experiments, LABES-S2S obtains strong results on three benchmark datasets of different scales. In utilizing unlabeled dialog data, semi-supervised LABES-S2S significantly outperforms both supervisedonly and semi-supervised baselines. Remarkably, we can reduce the annotation demands to 50% without performance loss on MultiWOZ.

show abstract