2020
DOI: 10.1007/978-3-030-61362-4_16
|View full text |Cite
|
Sign up to set email alerts
|

Shield Synthesis for Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
16
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 38 publications
(17 citation statements)
references
References 14 publications
1
16
0
Order By: Relevance
“…Several extensions exist [4,6,29,39]. The shielding approach has been shown to be successful in combination with RL [2,21]. Jansen et al [19] introduced offline shielding with respect to probabilistic safety.…”
Section: Related Workmentioning
confidence: 99%
“…Several extensions exist [4,6,29,39]. The shielding approach has been shown to be successful in combination with RL [2,21]. Jansen et al [19] introduced offline shielding with respect to probabilistic safety.…”
Section: Related Workmentioning
confidence: 99%
“…These approaches leverage SMT solvers (e.g., [19]), based on LP and MILP solvers (e.g., [6], [10], [35], [42]), the propagation of symbolic intervals and abstract interpretation (e.g., [13], [43]- [45]), abstraction-refinement techniques (e.g., [3], [11]), and many others. Recent work has extended beyond answering yes/no questions about DNNs, targeting tasks such as automated DNN repair [15], [30] and quantitative verification [4]. Verification approaches have also been proposed for recurrent networks [23], [47], which could potentially also be simplified.…”
Section: Related Workmentioning
confidence: 99%
“…In addition to general DNN verification engines, methods have been devised to formally verify safety properties of DRL systems, which are the subject matter of this work. Such approaches include shield synthesis [31], and combining the verification process with verified runtime monitoring [16]. Other methods focus on finding adversarial attacks that pertain specifically to DRL agents, e.g., by using MILP [11].…”
Section: Related Workmentioning
confidence: 99%