Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering 2020
DOI: 10.1145/3377811.3380400
|View full text |Cite
|
Sign up to set email alerts
|

Testing DNN image classifiers for confusion & bias errors

Abstract: Image classifiers are an important component of today's software, from consumer and business applications to safety-critical domains. The advent of Deep Neural Networks (DNNs) is the key catalyst behind such widespread success. However, wide adoption comes with serious concerns about the robustness of software systems dependent on DNNs for image classification, as several severe erroneous behaviors have been reported under sensitive and critical circumstances. We argue that developers need to rigorously test t… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 34 publications
(31 citation statements)
references
References 48 publications
0
31
0
Order By: Relevance
“…We further manually classified these 37 papers into three categories: autonomous driving testing [7,8,[26][27][28][29][30][31][32][33] contains papers related to testing and validation of ADSs. deep learning testing [9,[34][35][36][37][38][39][40][41][42][43][44][45][46][47][48][49][50][51][52][53] includes papers about testing methods and criteria for DL-based systems. deep-learning debugging and repair [54][55][56][57][58][59] includes papers referring to the deployment and bug fixing of DL systems.…”
Section: Literature Review Methodologymentioning
confidence: 99%
“…We further manually classified these 37 papers into three categories: autonomous driving testing [7,8,[26][27][28][29][30][31][32][33] contains papers related to testing and validation of ADSs. deep learning testing [9,[34][35][36][37][38][39][40][41][42][43][44][45][46][47][48][49][50][51][52][53] includes papers about testing methods and criteria for DL-based systems. deep-learning debugging and repair [54][55][56][57][58][59] includes papers referring to the deployment and bug fixing of DL systems.…”
Section: Literature Review Methodologymentioning
confidence: 99%
“…Zhang et al [78] and Aggarwal et al [2] attempted to test model fairness. Tian et al [69] are focused on testing the confusion and bias errors in DNNs. In this work, DNN testing is used for model reuse detection.…”
Section: Test Input Generation For Dnnmentioning
confidence: 99%
“…Fairness Testing: Recent approaches on fairness testing [2], [12], [32], [33], [40] are not directly applicable for fairness testing of NLP software. These approaches are mostly focused on the (causal) fairness testing of credit rating or computer vision systems.…”
Section: Related Workmentioning
confidence: 99%