Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen 2019
DOI: 10.18653/v1/d19-1292
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating adversarial attacks against multiple fact verification systems

Abstract: Automated fact verification has been progressing owing to advancements in modeling and availability of large datasets. Due to the nature of the task, it is critical to understand the vulnerabilities of these systems against adversarial instances designed to make them predict incorrectly. We introduce two novel scoring metrics, attack potency and system resilience which take into account the correctness of the adversarial instances, an aspect often ignored in adversarial evaluations. We consider six fact verifi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
28
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 31 publications
(28 citation statements)
references
References 22 publications
0
28
0
Order By: Relevance
“…While much recent work in adversarial attacks aims to break NLI systems and is especially adapted to this problem [13,29], these stress tests have been applied to several other tasks, e.g. Question-Answering [49], Machine Translation [4], or Fact Checking [1,44]. Unfortunately, preserving the semantics of a sentence while automatically generating these adversarial attacks is difficult, which is why some works have defined small stress tests manually [19,27].…”
Section: Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…While much recent work in adversarial attacks aims to break NLI systems and is especially adapted to this problem [13,29], these stress tests have been applied to several other tasks, e.g. Question-Answering [49], Machine Translation [4], or Fact Checking [1,44]. Unfortunately, preserving the semantics of a sentence while automatically generating these adversarial attacks is difficult, which is why some works have defined small stress tests manually [19,27].…”
Section: Datasetmentioning
confidence: 99%
“…To measure the effectiveness of each adversarial attack a ∈ A , we calculate the potency score introduced by Thorne et al [44] as the average reduction from a perfect score and across the systems s ∈ S: with c a representing the transformation correctness from test to adversarial samples and a function f that returns the performance score for a system s on an adversarial attack set a.…”
Section: Metrics (Original)mentioning
confidence: 99%
See 1 more Smart Citation
“…The submissions were scored using 'potency' and 'resilience' (Thorne et al, 2019) that compute a weighted average of FEVER scores: accounting for the correctness of adversarial instances.…”
Section: Scoring Methodsmentioning
confidence: 99%
“…These rule-based adversarial attacks are more common in the written vernacular and therefore are often found in misinformation posts on social media. [3].…”
Section: Evidence-based Overreliance -mentioning
confidence: 99%