Can Offline Testing of Deep Neural Networks Replace Their Online Testing?

Haq, Fitash Ul; Shin, Donghwan; Nejati, Shiva; Briand, Lionel C.

doi:10.1007/s10664-021-09982-4

Cited by 27 publications

(22 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…If any of the objects within the red bounding boxes were on a collision course with the ego car, commencing PAEB would indeed be the right action for SMIRK and thus not violate SYS-SAF-REQ1. This observation corroborates the position by (Haq et al, 2021), i.e., system level testing that goes beyond model testing on single frames is critically needed. All results from running ML model testing, i.e., ML Verification Results [Z], are documented in the Protocols folder.…”

Section: Model Testing [Aa]supporting

confidence: 86%

“…Second, SMIRK could be used as a realistic test benchmark for automotive ML testing. The testing community has largely worked on offline testing of single frames, but we know that this is insufficient (Haq et al, 2021). Third, we recommend the community to port SMIRK to other simulators beyond ESI Pro-SiVIC.…”

Section: Discussionmentioning

confidence: 99%

“…In contrast to most previous work that stop at pedestrian detection, we present an ADAS that subsequently commences emergency braking in a simulated environment. This addition responds to calls for researchers to go from offline to online testing (Haq et al, 2021), as many safety violations identified by online testing could not be identified by offline testing. We hope that SMIRK can contribute to a shift in the testing community away from standalone image data sets.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Ergo, SMIRK is Safe: A Safety Case for a Machine Learning Component in a Pedestrian Automatic Emergency Brake System

Borg¹,

Henriksson²,

Socha³

et al. 2022

Preprint

View full text Add to dashboard Cite

Integration of Machine Learning (ML) components in critical applications introduces novel challenges for software certification and verification. New safety standards and technical guidelines are under development to support the safety of ML-based systems, e.g., ISO 21448 SOTIF for the automotive domain and the Assurance of Machine Learning for use in Autonomous Systems (AMLAS) framework. SOTIF and AMLAS provide high-level guidance but the details must be chiseled out for each specific case. We report results from an industry-academia collaboration on safety assurance of SMIRK, an ML-based pedestrian

show abstract

Section: Model Testing [Aa]supporting

confidence: 86%

Section: Discussionmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Ergo, SMIRK is Safe: A Safety Case for a Machine Learning Component in a Pedestrian Automatic Emergency Brake System

Borg¹,

Henriksson²,

Socha³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…If False, the Rule Engine performs a sanity check based on laws of physics. (9) If UM remains confident that collision with a pedestrian is imminent, the signal to perform PAEB propagates to ego car.…”

Section: Smirk Architecturementioning

confidence: 99%

“…SMIRK allows researchers to explore data testing, ML model testing, integration testing, and system testing since data sets, ML model architectures, and the source code are publicly available. For example, offline model testing can be compared to online system testing, as Haq et al recently proved important [9]. Concrete test techniques that could be evaluated using SMIRK include search-based software testing, metamorphic testing, fuzz testing, neural network test adequacy assessments, and testing for explainable AI.…”

Section: Impact Overviewmentioning

confidence: 99%

SMIRK: A machine learning-based pedestrian automatic emergency braking system with a complete safety case

2022

View full text Add to dashboard Cite

Machine learning testing in an ADAS case study using simulation‐integrated bio‐inspired search‐based testing

Moghadam

Borg

Saadatmand

et al. 2023

J Software Evolu Process

View full text Add to dashboard Cite

SummaryThis paper presents an extended version of Deeper, a search‐based simulation‐integrated test solution that generates failure‐revealing test scenarios for testing a deep neural network‐based lane‐keeping system. In the newly proposed version, we utilize a new set of bio‐inspired search algorithms, genetic algorithm (GA), and evolution strategies (ES), and particle swarm optimization (PSO), that leverage a quality population seed and domain‐specific crossover and mutation operations tailored for the presentation model used for modeling the test scenarios. In order to demonstrate the capabilities of the new test generators within Deeper, we carry out an empirical evaluation and comparison with regard to the results of five participating tools in the cyber‐physical systems testing competition at SBST 2021. Our evaluation shows the newly proposed test generators in Deeper not only represent a considerable improvement on the previous version but also prove to be effective and efficient in provoking a considerable number of diverse failure‐revealing test scenarios for testing an ML‐driven lane‐keeping system. They can trigger several failures while promoting test scenario diversity, under a limited test time budget, high target failure severity, and strict speed limit constraints.

show abstract

Can Offline Testing of Deep Neural Networks Replace Their Online Testing?

Cited by 27 publications

References 26 publications

Ergo, SMIRK is Safe: A Safety Case for a Machine Learning Component in a Pedestrian Automatic Emergency Brake System

Ergo, SMIRK is Safe: A Safety Case for a Machine Learning Component in a Pedestrian Automatic Emergency Brake System

SMIRK: A machine learning-based pedestrian automatic emergency braking system with a complete safety case

Machine learning testing in an ADAS case study using simulation‐integrated bio‐inspired search‐based testing

Contact Info

Product

Resources

About