2022
DOI: 10.1609/icaps.v32i1.19820
|View full text |Cite
|
Sign up to set email alerts
|

Debugging a Policy: Automatic Action-Policy Testing in AI Planning

Abstract: Testing is a promising way to gain trust in neural action policies π. Previous work on policy testing in sequential decision making targeted environment behavior leading to failure conditions. But if the failure is unavoidable given that behavior, then π is not actually to blame. For a situation to qualify as a "bug" in π, there must be an alternative policy π' that does better. We introduce a generic policy testing framework based on that intuition. This raises the bug confirmation problem, deciding whether o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…For our experiments, we extend the testing framework by Eisenhut et al (2023), which builds on the one by Steinmetz et al (2022). As in this previous work, we test AS-Net policies (Toyer et al 2018(Toyer et al , 2020.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…For our experiments, we extend the testing framework by Eisenhut et al (2023), which builds on the one by Steinmetz et al (2022). As in this previous work, we test AS-Net policies (Toyer et al 2018(Toyer et al , 2020.…”
Section: Methodsmentioning
confidence: 99%
“…Recent work (Steinmetz et al 2022) addresses this in the context of classical planning, defining test cases as states s, and policy bugs as states on which the policy is sub-optimal (a better policy exists for s, e.g., avoiding failure). Given a test case s, test oracles are used to detect whether s is a bug, by evaluating sufficient criteria that avoid the need for a full optimal planning process (Eisenhut et al 2023).…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations