Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning

Xiong, Zikang; Jagannathan, Suresh

doi:10.48550/arxiv.2104.10219

Cited by 4 publications

(5 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Most research in safe DRL focuses on enhancing the safety and robustness by reducing potential unsafe actions, including methods for safe monitoring and adversarial training. For example, shielding methods [11], [12], [16] prevent agent from making unsafe actions, on every state. Mandlekar et al [17] used actively chosen adversarial perturbations for robust policy training to improve robustness (resistance to changes) in complex environments.…”

Section: Related Workmentioning

confidence: 99%

Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems through Probabilistic Model Checking

Dong¹,

Zhao²,

Huang³

2021

Preprint

View full text Add to dashboard Cite

While Deep Reinforcement Learning (DRL) provides transformational capabilities to the control of Robotics and Autonomous Systems (RAS), the black-box nature of DRL and uncertain deployment-environments of RAS pose new challenges on its dependability. Although there are many existing works imposing constraints on the DRL policy to ensure a successful completion of the mission, it is far from adequate in terms of assessing the DRL-driven RAS in a holistic way considering all dependability properties. In this paper, we formally define a set of dependability properties in temporal logic and construct a Discrete-Time Markov Chain (DTMC) to model the dynamics of risk/failures of a DRL-driven RAS interacting with the stochastic environment. We then do Probabilistic Model Checking based on the designed DTMC to verify those properties. Our experimental results show that the proposed method is effective as a holistic assessment framework, while uncovers conflicts between the properties that may need trade-offs in the training. Moreover, we find the standard DRL training cannot improve dependability properties, thus requiring bespoke optimisation objectives concerning them. Finally, our method offers a novel dependability analysis to the Sim-to-Real challenge of DRL.

show abstract

Section: Related Workmentioning

confidence: 99%

Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems through Probabilistic Model Checking

Dong¹,

Zhao²,

Huang³

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…One is based on model transformation, which transforms the embedded DNN model into an interpretable model such as decision trees and programs [3,32]. Another is to synthesize barrier functions that assist the DNN in decision making can ensure safety during deployment [35,33]. The last is to incorporate the DNN into the system dynamics [14,31].…”

Section: Efficiency and Scalabilitymentioning

confidence: 99%

“…Instead of directly verifying DRL systems, most of the existing approaches rely on transforming them into verifiable models. Representative works include exacting decision trees [3] and programmatic policies [32], synthesizing deterministic programs [35] and linear controllers [33], transforming into hybrid systems [14] and star sets [31]. Although these transformation-based approaches are effective solutions, there are some limitations, e.g., extracted policies may not equivalently represent source neural networks and the properties that can be verified may be limited.…”

Section: Introductionmentioning

confidence: 99%

Learning on Abstract Domains: A New Approach for Verifiable Guarantee in Reinforcement Learning

Jin,

Zhang,

et al. 2021

Preprint

View full text Add to dashboard Cite

Formally verifying Deep Reinforcement Learning (DRL) systems is a challenging task due to the dynamic continuity of system behaviors and the black-box feature of embedded neural networks. In this paper, we propose a novel abstraction-based approach to train DRL systems on finite abstract domains instead of concrete system states. It yields neural networks whose input states are finite, making hosting DRL systems directly verifiable using model checking techniques. Our approach is orthogonal to existing DRL algorithms and off-the-shelf model checkers. We implement a resulting prototype training and verification framework and conduct extensive experiments on the state-of-the-art benchmark. The results show that the systems trained in our approach can be verified more efficiently while they retain comparable performance against those that are trained without abstraction.Preprint. Under review.

show abstract

“…A typical example is autonomous driving, which is arguably still a long way off due to safety concerns [21,39]. Recently, tremendous efforts have been made toward adapting existing and devising new formal methods for DRL systems in order to provide provable safety guarantees [18,25,45,46,51].…”

Section: Introductionmentioning

confidence: 99%

Trainify: A CEGAR-Driven Training and Verification Framework for Safe Deep Reinforcement Learning

Peng

Jiaxu

Zhi

et al. 2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Deep Reinforcement Learning (DRL) has demonstrated its strength in developing intelligent systems. These systems shall be formally guaranteed to be trustworthy when applied to safety-critical domains, which is typically achieved by formal verification performed after training. This train-then-verify process has two limits: (i) trained systems are difficult to formally verify due to their continuous and infinite state space and inexplicable AI components (i.e., deep neural networks), and (ii) the ex post facto detection of bugs increases both the time- and money-wise cost of training and deployment. In this paper, we propose a novel verification-in-the-loop training framework called Trainify for developing safe DRL systems driven by counterexample-guided abstraction and refinement. Specifically, Trainify trains a DRL system on a finite set of coarsely abstracted but efficiently verifiable state spaces. When verification fails, we refine the abstraction based on returned counterexamples and train again on the finer abstract states. The process is iterated until all predefined properties are verified against the trained system. We demonstrate the effectiveness of our framework on six classic control systems. The experimental results show that our framework yields more reliable DRL systems with provable guarantees without sacrificing system performance such as cumulative reward and robustness than conventional DRL approaches.

show abstract

Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning

Cited by 4 publications

References 29 publications

Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems through Probabilistic Model Checking

Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems through Probabilistic Model Checking

Learning on Abstract Domains: A New Approach for Verifiable Guarantee in Reinforcement Learning

Trainify: A CEGAR-Driven Training and Verification Framework for Safe Deep Reinforcement Learning

Contact Info

Product

Resources

About