Proceedings of the 28th International Conference on Computational Linguistics 2020
DOI: 10.18653/v1/2020.coling-main.126
|View full text |Cite
|
Sign up to set email alerts
|

Contrastive Zero-Shot Learning for Cross-Domain Slot Filling with Adversarial Attack

Abstract: Zero-shot slot filling has widely arisen to cope with data scarcity in target domains. However, previous approaches often ignore constraints between slot value representation and related slot description representation in the latent space and lack enough model robustness. In this paper, we propose a Contrastive Zero-Shot Learning with Adversarial Attack (CZSL-Adv) method for the cross-domain slot filling. The contrastive loss aims to map slot value contextual representations to the corresponding slot descripti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 26 publications
(25 citation statements)
references
References 20 publications
0
25
0
Order By: Relevance
“…In the second stage, our model encodes the slot entity and predicts the label for it by calculating the similarity with the slot prototypes in the label semantic space. Unlike the previous works (Liu et al, 2020b;He et al, 2020c) which directly utilize the slot name embedding as slot prototypes, we introduce Prototypical Contrastive learning and Label Confusion strategies (PCLC) strategies to dynamically refine the constraint relationship between slot prototypes in the semantic space, as shown in Fig 2 . In the training procedure, we use an MLP layer to encode the original slot name embedding. So we can obtain a dynamically updated slot prototype matrix.…”
Section: Overall Architecturementioning
confidence: 99%
See 2 more Smart Citations
“…In the second stage, our model encodes the slot entity and predicts the label for it by calculating the similarity with the slot prototypes in the label semantic space. Unlike the previous works (Liu et al, 2020b;He et al, 2020c) which directly utilize the slot name embedding as slot prototypes, we introduce Prototypical Contrastive learning and Label Confusion strategies (PCLC) strategies to dynamically refine the constraint relationship between slot prototypes in the semantic space, as shown in Fig 2 . In the training procedure, we use an MLP layer to encode the original slot name embedding. So we can obtain a dynamically updated slot prototype matrix.…”
Section: Overall Architecturementioning
confidence: 99%
“…• Contrastive Zero-Shot Learning with Adversarial Attack (CZSL-Adv) A method proposed by (He et al, 2020c) based on Coach, which utilizes contrastive learning and adversarial attacks to improve the performance and robustness of the framework. Implementation Details We follow the setup of (Liu et al, 2020b), selecting one domain as the target domain at a time, and use 500 samples in this domain as a validation set, the rest as a test set.…”
Section: Setupmentioning
confidence: 99%
See 1 more Smart Citation
“…for pre-defined slot types, using character embedding (Liang et al, 2017a), copy mechanism (Zhao and Feng, 2018), few/zero-shot learning (Hu et al, 2019;He et al, 2020e;Shah et al, 2019), transfer learning (Chen and Moschitti, 2019;He et al, 2020c,b) and background knowledge (Yang and Mitchell, 2017;He et al, 2020d), etc. Compared to OOV recognition, our proposed novel slot detection task focuses on detecting unknown slot types, not just unseen values.…”
Section: Introductionmentioning
confidence: 99%
“…NSD aims to discover potential new or out-of-domain entity types to strengthen the capability of a dialogue system based on in-domain precollected training data. There are two aspects in the previous work related to NSD, out-of-vocabulary (OOV) recognition (Liang et al, 2017a;Zhao and Feng, 2018;Hu et al, 2019;He et al, 2020c,d;Yan et al, 2020;He et al, 2020e) and out-of-domain (OOD) intent detection (Lin and Xu, 2019;Larson et al, 2019;Xu et al, 2020a;Zeng et al, 2021b,a) 1: Comparison between slot filling and novel slot detection. In the novel slot detection labels, we consider "album" as an unknown slot type that is out of the scope of the pre-defined slot set.…”
Section: Introductionmentioning
confidence: 99%