2021 IEEE 26th Pacific Rim International Symposium on Dependable Computing (PRDC) 2021
DOI: 10.1109/prdc53464.2021.00012
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking Safety Monitors for Image Classifiers with Machine Learning

Abstract: High-accurate machine learning (ML) image classifiers cannot guarantee that they will not fail at operation. Thus, their deployment in safety-critical applications such as autonomous vehicles is still an open issue. The use of fault tolerance mechanisms such as safety monitors is a promising direction to keep the system in a safe state despite errors of the ML classifier. As the prediction from the ML is the core information directly impacting safety, many works are focusing on monitoring the ML model itself. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 24 publications
0
13
0
Order By: Relevance
“…A complete literature review about MLRM mechanisms is not the purpose of this paper, but we present a short taxonomy of existing techniques to illustrate the general philosophy inspired from [16]. A first family of approaches predicts misbehavior of a ML model by analyzing its inputs.…”
Section: B Runtime Monitoring Of MLmentioning
confidence: 99%
See 1 more Smart Citation
“…A complete literature review about MLRM mechanisms is not the purpose of this paper, but we present a short taxonomy of existing techniques to illustrate the general philosophy inspired from [16]. A first family of approaches predicts misbehavior of a ML model by analyzing its inputs.…”
Section: B Runtime Monitoring Of MLmentioning
confidence: 99%
“…For instance, the work presented in [17] uses a benchmarking methodology based on three datasets, one for training the model, one for tuning the monitor and one for evaluation. Likewise, a complete benchmarking framework, relying on artificial generation of OOD data, is presented in [16]. These approaches present the advantage of using different data sources to fit a monitor and to assess its performance.…”
Section: B Runtime Monitoring Of MLmentioning
confidence: 99%
“…In total, we produced 79 benchmark datasets. More details can be found in the results section of our repository [32].…”
Section: A Data Profilementioning
confidence: 99%
“…Examples include, but are not limited to: occlusions, shadows, defects of the camera lens, changes in environmental light, raindrops on the camera lens, out-offocus, flare [18], [19]. Therefore, to guarantee safety of the driving task, it is necessary to study the robustness of TSR systems against the aforementioned threats, and develop solutions to tolerate them [20], [21]. This paper accomplishes this tasks, first reviewing the challenges and the state of the art, and then proposing and evaluating alternative solutions.…”
Section: Introductionmentioning
confidence: 99%