Confidence‐driven weighted retraining for predicting safety‐critical failures in autonomous driving systems

Stocco, Andrea; Tonella, Paolo

doi:10.1002/smr.2386

Cited by 26 publications

(4 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, since these metrics characterise only the inputs' structural features, we considered additional metrics to characterise the quality of self-driving, i.e., metrics to quantify the behavioural features. In particular, we selected the following two metrics from the study by Jahangirova et al [35]: standard deviation of steering angle (StdSA), which measures the activity of the driving agent on the steering wheel and can be used to quantify passenger comfort; and the car's mean lateral position (MLP), which measures how close the driving agent drives from to the lane margins and can be used to measure safety [22,59,63].…”

Section: Metric Identificationmentioning

confidence: 99%

Efficient and Effective Feature Space Exploration for Testing Deep Learning Systems

Zohdinasab

Riccio

Gambi

et al. 2023

ACM Trans. Softw. Eng. Methodol.

View full text Add to dashboard Cite

Assessing the quality of Deep Learning (DL) systems is crucial, as they are increasingly adopted in safety-critical domains. Researchers have proposed several input generation techniques for DL systems. While such techniques can expose failures, they do not explain which features of the test inputs influenced the system’s (mis-) behaviour. DeepHyperion was the first test generator to overcome this limitation by exploring the DL systems’ feature space at large. In this paper, we propose DeepHyperion-CS , a test generator for DL systems which enhances DeepHyperion by promoting the inputs that contributed more to feature space exploration during the previous search iterations. We performed an empirical study involving two different test subjects (i.e., a digit classifier and a lane-keeping system for self-driving cars). Our results proved that the contribution-based guidance implemented within DeepHyperion-CS outperforms state-of-the-art tools and significantly improves the efficiency and the effectiveness of DeepHyperion . DeepHyperion-CS exposed significantly more misbehaviours for 5 out of 6 feature combinations and was up to 65% more efficient than DeepHyperion in finding misbehaviour-inducing inputs and exploring the feature space. DeepHyperion-CS was useful for expanding the datasets used to train the DL systems, populating up to 200% more feature map cells than the original training set.

show abstract

Section: Metric Identificationmentioning

confidence: 99%

Efficient and Effective Feature Space Exploration for Testing Deep Learning Systems

Zohdinasab

Riccio

Gambi

et al. 2023

ACM Trans. Softw. Eng. Methodol.

View full text Add to dashboard Cite

show abstract

“…We measure predictive uncertainty with the variance of dropout-based DNNs' predictions, estimated using the Monte Carlo (MC) method, or MC-Dropout [41]. MC-Dropout approximates epistemic uncertainty of DNNs that perform a regression, such as our SDC models [38], [42].…”

Section: Predictive Uncertaintymentioning

confidence: 99%

“…While we also use (universal) adversarial attacks, differently, in our work, we focus on system-level testing and on simulated vs real SDCs, finding simulated SDCs generally more susceptible to adversarial attacks than their real-world counterparts when tested at the system level. Concerning system-level testing techniques for AVs, researchers proposed techniques to generate scenarios that cause AVs to misbehave [11], [15], [42], [50], [57]. These works only consider simulated testing, whereas we compared virtual vs physical environments.…”

Section: Model-and System-level Testing Approachesmentioning

confidence: 99%

Mind the Gap! A Study on the Transferability of Virtual vs Physical-world Testing of Autonomous Driving Systems

Stocco,

Pulfer,

Tonella

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Safe deployment of self-driving cars (SDC) necessitates thorough simulated and in-field testing. Most testing techniques consider virtualized SDCs within a simulation environment, whereas less effort has been directed towards assessing whether such techniques transfer to and are effective with a physical real-world vehicle. In this paper, we leverage the Donkey Car open-source framework to empirically compare testing of SDCs when deployed on a physical small-scale vehicle vs its virtual simulated counterpart. In our empirical study, we investigate transferability of behavior and failure exposure between virtual and real-world environments on a vast set of corrupted and adversarial settings. While a large number of testing results do transfer between virtual and physical environments, we also identified critical shortcomings that contribute to the reality gap between the virtual and physical world, threatening the potential of existing testing solutions when applied to physical SDCs.

show abstract

“…The paper “ Confidence‐driven Weighted Retraining for Predicting Safety‐Critical Failures in Autonomous Driving Systems ” 1 addresses the challenges of predicting potential harmful failures when unexpected system conditions occur. The work focuses on autonomous driving systems and proposes a framework that guides the adaptive retraining of misbehavior predictors.…”

mentioning

confidence: 99%

About the special issue on: “Distributed Complex Systems: Governance, Engineering, and Maintenance”

Braione

Briola

Angelis

et al. 2022

J Software Evolu Process

View full text Add to dashboard Cite

The volume at hand presents the Special Issue on “Distributed Complex Systems: Governance, Engineering, and Maintenance”. The Special Issue has been originally conceived within the context of the 2nd International Workshop on Governing Adaptive and Unplanned Systems of Systems (GAUSS 2020), one of the co‐located events of the 31st International Symposium on Software Reliability Engineering (ISSRE 2020). The authors of the best papers at GAUSS 2020 have been invited to submit an extended version of their previous work. In addition, the editors opened the submission also to all the other researches working on technical and managerial solution for governing Distributed Complex Systems. Ultimate objective of the editors with this Special Issue is to promote discussions focusing on anticipating, mitigating, or reacting to scenarios that were unplanned or under‐specified at design time.

show abstract

Confidence‐driven weighted retraining for predicting safety‐critical failures in autonomous driving systems

Cited by 26 publications

References 47 publications

Efficient and Effective Feature Space Exploration for Testing Deep Learning Systems

Efficient and Effective Feature Space Exploration for Testing Deep Learning Systems

Mind the Gap! A Study on the Transferability of Virtual vs Physical-world Testing of Autonomous Driving Systems

About the special issue on: “Distributed Complex Systems: Governance, Engineering, and Maintenance”

Contact Info

Product

Resources

About