2022
DOI: 10.48550/arxiv.2205.12331
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Certified Robustness Against Natural Language Attacks by Causal Intervention

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…Some include video moment retrieval [3], visual semantic segmentation [4], and visual dialog [5]. For NLP, causal inference has attracted myriad attention as a method to interpret adversarial attacks [6] and eradicate spurious confounding factors in SGD optimizer [7].…”
Section: A Causal Intervention For Deep Learningmentioning
confidence: 99%
“…Some include video moment retrieval [3], visual semantic segmentation [4], and visual dialog [5]. For NLP, causal inference has attracted myriad attention as a method to interpret adversarial attacks [6] and eradicate spurious confounding factors in SGD optimizer [7].…”
Section: A Causal Intervention For Deep Learningmentioning
confidence: 99%
“…Zhao et al [307] take a causal perspective on the natural language attack problem and frame the source of adversarial vulnerability as the spurious association induced by confounders. Fig.…”
Section: Certified Robustness Against Natural Language Attacksmentioning
confidence: 99%
“…To defend models against such attacks, Zhao et al [307] show that a Gaussianbased randomized classifier models the interventional distribution p(y | do(x)) and is therefore robust against l 2 -bounded attacks. However, textual input spaces are not continuous and text substitutions do not follow Gaussian distributions.…”
Section: Counterfactual Explanationsmentioning
confidence: 99%
“…First-order approximation [2], l 2 -ball [3], [53] [50], and axis-aligned bound [4] and [52] are popular ways to model perturbations. Axisaligned bound can also be combined with randomized smoothing techniques [74]. Our method is different from previous methods as we use convex hulls to model the attack space and capture the geometry of word substitutions more precisely.…”
Section: Related Workmentioning
confidence: 99%