Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation

Chen, Xiuyi; Meng, Fandong; Li, Peng; Chen, Feilong; Xu, Shuang; Xu, Bo; Zhou, Jie

doi:10.18653/v1/2020.emnlp-main.275

Cited by 66 publications

(57 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Automatic Evaluation. We automatically evaluate knowledge selection with accuracy (Acc), response generation with perplexity (PPL), unigram F1 (R-1) and bigram F1 (R-2), which are commonly used in this task (Dinan et al, 2019;Kim et al, 2020a;Chen et al, 2020b). We also remove all the punctuation and (a, an, the) to compute the R-1 and R-2 scores as (Kim et al, 2020a) do.…”

Section: Discussionmentioning

confidence: 99%

“…Knowledge Distillation: We further alleviate the noisy labeling problem of distance supervision via Knowledge Distillation (KD) as shown in Figure 1 (b). Following (Tian et al, 2020;Chen et al, 2020b), the teacher takes the context and response as input and generates the distribution of knowledge selection as soft target. Compared with the student, i.e., the standard knowledge selection module described in Section 2.4, teacher has the gold response as an additional input.…”

Section: Distilled Distant Supervision Lossmentioning

confidence: 99%

“…It is a sport or leisure activity in which a player rolls a bowling ball towards a target. Kim et al, 2020a;Chen et al, 2020b;. In this paper, we focus on knowledge selection in the unsupervised setting where there is no gold knowledge label.…”

Section: Uksdg Pfmentioning

confidence: 99%

See 2 more Smart Citations

Unsupervised Knowledge Selection for Dialogue Generation

Chen¹,

Chen²,

Meng³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

Self Cite

View full text Add to dashboard Cite

Knowledge selection is an important and challenging task which could provide the appropriate knowledge for informative dialogue generation. However, the needed gold knowledge label is difficult to collect in reality. In this paper, we study knowledge selection for dialogue generation in the unsupervised scenario and propose a novel Distilled Distant Supervision Loss (DDSL) to supervise knowledge selection when the gold knowledge label is unknown. Specifically, we first obtain an oracle knowledge label via distant supervision and then leverage knowledge distillation to alleviate the noisy labeling problem of distant supervision. Furthermore, we propose a pretraining-finetuning strategy to deal with the mismatch knowledge selection problem that models tend to select the mismatched knowledge for dialogue generation in the unsupervised setting and will cause the degeneration of knowledge-aware decoder. Experiments on two knowledge-grounded dialogue datasets show that our approach manages to select knowledge more accurately in the unsupervised setting and generates more informative responses, even outperforming many strong supervised baselines. 1

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Distilled Distant Supervision Lossmentioning

confidence: 99%

See 1 more Smart Citation

Unsupervised Knowledge Selection for Dialogue Generation

Chen¹,

Chen²,

Meng³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…In addition to the works to enrich the contents of open-domain conversations by controllable generation (Lin et al, 2020;Madotto et al, 2020b), the knowledge grounded dialogue task aims to offer more informative conversation by leveraging an external knowledge source (Dinan et al, 2018;. Relevant knowledge selection is the key to improving the whole system, and very recently, latent variable models have been attracting more attention for this purpose (Lian et al, 2019;Liu et al, 2019b;Kim et al, 2020;Chen et al, 2020;Xu et al, 2021).…”

Section: Related Workmentioning

confidence: 99%

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System

Xu¹,

Ishii²,

Winata³

et al. 2021

Proceedings of the 1st Workshop on Document-Grounded Dialogue and Conversational Question Answering (DialDoc 2021)

View full text Add to dashboard Cite

Information-seeking dialogue systems, including knowledge identification and response generation, aim to respond to users with fluent, coherent, and informative responses based on users' needs, which. To tackle this challenge, we utilize data augmentation methods and several training techniques with the pre-trained language models to learn a general pattern of the task and thus achieve promising performance. In DialDoc21 competition, our system achieved 74.95 F1 score and 60.74 Exact Match score in subtask 1, and 37.72 Sacre-BLEU score in subtask 2. Empirical analysis is provided to explain the effectiveness of our approaches.

show abstract

“…Recently, there is increasing interest in visionlanguage tasks, such as image caption Anderson et al, 2016Anderson et al, , 2018Cornia et al, 2020) and visual question answering (Ren et al, 2015a;Gao et al, 2015;Lu et al, 2016;Anderson et al, 2018). In the real world, our conversations (Chen et al, 2020b(Chen et al, , 2019 usually have multiple turns. As an extension of conventional single-turn visual question answering, Das et al (2017) introduce a multi-turn visual question answering task named visual dialogue, which aims to Q1: how many people ?…”

Section: Introductionmentioning

confidence: 99%

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Chen¹,

Meng²,

Chen³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

Self Cite

View full text Add to dashboard Cite

Visual dialogue is a challenging task since it needs to answer a series of coherent questions on the basis of understanding the visual environment. Previous studies focus on the implicit exploration of multimodal coreference by implicitly attending to spatial image features or object-level image features but neglect the importance of locating the objects explicitly in the visual content, which is associated with entities in the textual content. Therefore, in this paper we propose a Multimodal Incremental Transformer with Visual Grounding, named MITVG, which consists of two key parts: visual grounding and multimodal incremental transformer. Visual grounding aims to explicitly locate related objects in the image guided by textual entities, which helps the model exclude the visual content that does not need attention. On the basis of visual grounding, the multimodal incremental transformer encodes the multi-turn dialogue history combined with visual scene step by step according to the order of the dialogue and then generates a contextually and visually coherent response. Experimental results on the VisDial v0.9 and v1.0 datasets demonstrate the superiority of the proposed model, which achieves comparable performance.

show abstract

Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation

Cited by 66 publications

References 33 publications

Unsupervised Knowledge Selection for Dialogue Generation

Unsupervised Knowledge Selection for Dialogue Generation

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Contact Info

Product

Resources

About