2020
DOI: 10.48550/arxiv.2006.11130
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Systematic Attack Surface Reduction For Deployed Sentiment Analysis Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…Recent papers focus on securing models during the development process. Green team machine learning creates a process called "Build, Attack, Defend" to evaluate the machine learning models during the development process and begin protecting against red team style attacks on the models [9]. Attacks are increasingly sophisticated with their ability to detect the underlying model architecture and therefore, exploit vulnerabilities in these models.…”
Section: Securing Deployed Models Against Adversarial Attacksmentioning
confidence: 99%
“…Recent papers focus on securing models during the development process. Green team machine learning creates a process called "Build, Attack, Defend" to evaluate the machine learning models during the development process and begin protecting against red team style attacks on the models [9]. Attacks are increasingly sophisticated with their ability to detect the underlying model architecture and therefore, exploit vulnerabilities in these models.…”
Section: Securing Deployed Models Against Adversarial Attacksmentioning
confidence: 99%
“…Further, Kurita et al (2020) observed that in spite of rich sub-word representations, a BERT-based classifier can be deceived by inserting a specific rare word to an abusive sentence. Kalin et al (2020) proposed a structured approach for securing a toxicity detection classifier in a production setting.…”
Section: Current Technical and Ethical Challengesmentioning
confidence: 99%
“…As shown in Figure 3, Leet-speak uses an alternative alphabet of numbers and symbols to replace various letters in words [1][2][3]. As an innovative language strategy, Leet might be one of the first adversarial attacks on machine-driven filters [31].…”
Section: Introductionmentioning
confidence: 99%