Machine Learning with Electronic Health Records is vulnerable to Backdoor Trigger Attacks

Joe, Byunggill; Mehra, Aseem; Shin, Insik; Hamm, Jihun

doi:10.48550/arxiv.2106.07925

Cited by 2 publications

(10 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the “Attack Efficacy” section of this paper, we describe 2 experiments that investigated “random poisoning” and “target poisoning.” To assess the stealthiness of the attack, we experimented with the visual similarity between the trigger data and the clean data (described in the “Stealthiness” section) and the impact of an attack on general classification performance (“Impact on Classification Performance” section). We also compare performance with an existing technique [ 19 ] in the “Comparative Performance” section.…”

Section: Resultsmentioning

confidence: 99%

“…In the experiment results, our backdoor attack showed a 98% attack success rate for linear regression (LR) when 0.4% of the training data set was poisoned with trigger data. Considering that the previous approach [ 19 ] required 3% data poisoning to achieve the same success rate, our attack shows significant performance improvements. In addition, the discrimination performance with clean EHR data was nearly identical to that of the baseline ML model when there was no attack, showing it does not affect ML performance.…”

Section: Introductionmentioning

confidence: 99%

“…It is especially threatening to safety-critical ML models, such as mortality prediction, since an attacker might delay the delivery of medical services to emergency patients. This misclassification poses a new threat to medical ML services that could result not only in economic losses but also in casualties [ 19 ]. Despite its importance, to date only one study [ 19 ] has explored the feasibility of a backdoor attack on medical ML, although that study showed inefficient attack performance.…”

Section: Introductionmentioning

confidence: 99%

“…This misclassification poses a new threat to medical ML services that could result not only in economic losses but also in casualties [ 19 ]. Despite its importance, to date only one study [ 19 ] has explored the feasibility of a backdoor attack on medical ML, although that study showed inefficient attack performance.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Exploiting Missing Value Patterns for a Backdoor Attack on Machine Learning Models of Electronic Health Records: Development and Validation Study

Joe¹,

Park²,

Hamm³

et al. 2022

JMIR Med Inform

Self Cite

View full text Add to dashboard Cite

Background A backdoor attack controls the output of a machine learning model in 2 stages. First, the attacker poisons the training data set, introducing a back door into the victim’s trained model. Second, during test time, the attacker adds an imperceptible pattern called a trigger to the input values, which forces the victim’s model to output the attacker’s intended values instead of true predictions or decisions. While backdoor attacks pose a serious threat to the reliability of machine learning–based medical diagnostics, existing backdoor attacks that directly change the input values are detectable relatively easily. Objective The goal of this study was to propose and study a robust backdoor attack on mortality-prediction machine learning models that use electronic health records. We showed that our backdoor attack grants attackers full control over classification outcomes for safety-critical tasks such as mortality prediction, highlighting the importance of undertaking safe artificial intelligence research in the medical field. Methods We present a trigger generation method based on missing patterns in electronic health record data. Compared to existing approaches, which introduce noise into the medical record, the proposed backdoor attack makes it simple to construct backdoor triggers without prior knowledge. To effectively avoid detection by manual inspectors, we employ variational autoencoders to learn the missing patterns in normal electronic health record data and produce trigger data that appears similar to this data. Results We experimented with the proposed backdoor attack on 4 machine learning models (linear regression, multilayer perceptron, long short-term memory, and gated recurrent units) that predict in-hospital mortality using a public electronic health record data set. The results showed that the proposed technique achieved a significant drop in the victim’s discrimination performance (reducing the area under the precision-recall curve by at most 0.45), with a low poisoning rate (2%) in the training data set. In addition, the impact of the attack on general classification performance was negligible (it reduced the area under the precision-recall curve by an average of 0.01025), which makes it difficult to detect the presence of poison. Conclusions To the best of our knowledge, this is the first study to propose a backdoor attack that uses missing information from tabular data as a trigger. Through extensive experiments, we demonstrated that our backdoor attack can inflict severe damage on medical machine learning classifiers in practice.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Exploiting Missing Value Patterns for a Backdoor Attack on Machine Learning Models of Electronic Health Records: Development and Validation Study

Joe¹,

Park²,

Hamm³

et al. 2022

JMIR Med Inform

Self Cite

View full text Add to dashboard Cite

show abstract

“…Since then, there have been several attacks and defenses in neural networks with attacks focusing on stealth and undetectability and defenses focusing on generalization of detection across datasets and applications [14,26]. The backdoor attack literature primarily focuses on DNNs, specifically because of the black-box nature of DNNs which deters the development of a generic defense, with very few focusing on smaller models [20,45]. The triggers are designed from the perspective of the input, rather than the model, so that they remain hidden (inconspicuous) from the user.…”

Section: Related Workmentioning

confidence: 99%

TRAPDOOR: Repurposing backdoors to detect dataset bias in machine learning-based genomic analysis

Sarkar¹,

Maniatakos²

2021

Preprint

View full text Add to dashboard Cite

Machine Learning (ML) has achieved unprecedented performance in several applications including image, speech, text, and data analysis. Use of ML to understand underlying patterns in gene mutations (genomics) has far-reaching results, not only in overcoming diagnostic pitfalls, but also in designing treatments for life-threatening diseases like cancer. Success and sustainability of ML algorithms depends on the quality and diversity of data collected and used for training. Under-representation of groups (ethnic groups, gender groups, etc.) in such a dataset can lead to inaccurate predictions for certain groups, which can further exacerbate systemic discrimination issues.In this work, we propose TRAPDOOR, a methodology for identification of biased datasets by repurposing a technique that has been mostly proposed for nefarious purposes: Neural network backdoors. We consider a typical collaborative learning setting of the genomics supply chain, where data may come from hospitals, collaborative projects, or research institutes to a central cloud without awareness of bias against a sensitive group. In this context, we develop a methodology to leak potential bias information of the collective data without hampering the genuine performance using ML backdooring catered for genomic applications. Using a real-world cancer dataset, we analyze the dataset with the bias that already existed towards white individuals and also introduced biases in datasets artificially, and our experimental result show that TRAP-DOOR can detect the presence of dataset bias with 100% accuracy, and furthermore can also extract the extent of bias by recovering the percentage with a small error. CCS CONCEPTS• Security and privacy → Systems security.

show abstract

Machine Learning with Electronic Health Records is vulnerable to Backdoor Trigger Attacks

Cited by 2 publications

References 11 publications

Exploiting Missing Value Patterns for a Backdoor Attack on Machine Learning Models of Electronic Health Records: Development and Validation Study

Exploiting Missing Value Patterns for a Backdoor Attack on Machine Learning Models of Electronic Health Records: Development and Validation Study

TRAPDOOR: Repurposing backdoors to detect dataset bias in machine learning-based genomic analysis

Contact Info

Product

Resources

About