Constraining the Attack Space of Machine Learning Models with Distribution Clamping Preprocessing

Feng, Ryan; Jha, Somesh; Prakash, Atul

doi:10.48550/arxiv.2205.08989

Cited by 1 publication

(3 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One category of input-based defenses is digital defenses, which use various preprocessing functions to protect against evasion attacks that either bypass multi-sensor physical defenses or are executed through a public-facing API. These defenses can fall under the following categories: detect and remove the attack vector; [28][29][30] implement non-differentiable functions to obscure gradients; 31,32 sanitize the attack vector to eliminate adversarial perturbations; [33][34][35] and apply formal verification [36][37][38] or certification techniques [39][40][41] to provide performance guarantees. Defenses applied to training data protect against poisoning attacks by filtering out potentially poisoned data samples [42][43][44][45][46] .…”

Section: Defense Preparationmentioning

confidence: 99%

“…Model input Adversarial detection and removal, [28][29][30] input sanitization, [33][34][35] gradient obfuscation, 31,32 provable defenses [36][37][38][39][40][41] Remove adversarial perturbation, increase attack cost, or provide performance guarantees…”

Section: Defense Location Options Descriptionmentioning

confidence: 99%

“…There are numerous detection and removal defenses available. 28,29 Implementing and testing them can aid in identifying the specific algorithm that is most effective in meeting the system requirements and defending against the threat model. Generally, simpler defense algorithms execute more quickly, leading to reduced latency and increased throughput.…”

Section: Defense For Autonomous Vehiclesmentioning

confidence: 99%

See 2 more Smart Citations

An AI blue team playbook

Tan,

Yamaguchi,

Raney

et al. 2024

Assurance and Security for AI-enabled Systems

View full text Add to dashboard Cite

In a fiercely competitive landscape, we are deploying AI systems faster than they can be security tested and defended. With developers under pressure to deliver on functionality and performance as quickly as possible, security is too often left as an afterthought. In response to emerging security challenges, we present a playbook to establish an AI blue teaming process for mitigating vulnerabilities before they can be exploited in the wild. By describing the process as part of a larger framework known as Build-Attack-Defend (BAD), we define an iterative and collaborative process between the AI system development and security teams, as well as various stakeholders. Our playbook contains the blue teaming historical context, process, lessons learned and hypothetical examples, serving as a starting point for embedding security at the heart of AI-enabled systems.

show abstract