Encoding Human Domain Knowledge to Warm Start Reinforcement Learning

Silva, Andrew; Gombolay, Matthew

doi:10.1609/aaai.v35i6.16638

Cited by 21 publications

(21 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…However, it has been found that approaches that rely on visual assessment can sometimes be misleading, as they may be specific to unique data or modelling conditions, and can be highly susceptible to outlying outputs that contradict the explanation [33][34][35]. Prior work has also sought to transform uninterpretable deep networks into interpretable architectures or modalities such as decision trees [12,13,36], or bayesian rule lists [14], and generate explanations by exploiting the "white-box" nature of these architectures [37].…”

Section: Explainable Ai Methodologiesmentioning

confidence: 99%

“…Explanation Format -We chose XAI modalities that can all be generated from a single methodology presented in prior work [13]. This method converts learned policies into discretized decision trees which elucidate the decision making process within an AI-agent's policy [37].…”

Section: Experiments Designmentioning

confidence: 99%

“…Transparency-based approaches seek to expose the internal mechanisms of an algorithm in a simpler format. Prior work has established strong baselines for transparency through gradient-based relevance methods [10,11], or presented inherently interpretable architectures wherein the explainability comes from the nature of the architecture [12][13][14]. Contrarily, post-hoc approaches provide additional context to a user Figure 1: The job of an XAI agent is to present the user with an explanation they can apply towards understanding how the AI agent works.…”

Section: Introductionmentioning

confidence: 99%

“…Decision trees have become a popular method of explaining decisions for human-AI teaming scenarios [36]. Differential decision trees have been proven to be an interpretable method of representing a policies that can be employed towards generating "white-box" explanations for users that actually represent the underlying behavior [13]. Language has also shown to be an effective means of explaining the actions of an sequential decision making agent [2,38,50].…”

mentioning

confidence: 99%

See 3 more Smart Citations

Towards Reconciling Usability and Usefulness of Explainable AI Methodologies

Tambwekar¹,

Gombolay²

2023

Preprint

View full text Add to dashboard Cite

Interactive Artificial Intelligence (AI) agents are becoming increasingly prevalent in society. However, application of such systems without understanding them can be problematic. Black-box AI systems can lead to liability and accountability issues when they produce an incorrect decision. Explainable AI (XAI) seeks to bridge the knowledge gap, between developers and end-users, by offering insights into how an AI algorithm functions. Many modern algorithms focus on making the AI model "transparent", i.e. unveil the inherent functionality of the agent in a simpler format. However, these approaches do not cater to end-users of these systems, as users may not possess the requisite knowledge to understand these explanations in a reasonable amount of time. Therefore, to be able to develop suitable XAI methods, we need to understand the factors which influence subjective perception and objective usability. In this paper, we present a novel user-study which studies four differing XAI modalities commonly employed in prior work for explaining AI behavior, i.e. Decision Trees, Text, Programs. We study these XAI modalities in the context of explaining the actions of a self-driving car on a highway, as driving is an easily understandable real-world task and self-driving cars is a keen area of interest within the AI community. Our findings highlight internal consistency issues wherein participants perceived language explanations to be significantly more usable, however participants were better able to objectively understand the decision making process of the car through a decision tree explanation. Our work also provides further evidence of importance of integrating user-specific and situational criteria into the design of XAI systems. Our findings show that factors such as computer science experience, and watching the car succeed or fail can impact the perception and usefulness of the explanation.

show abstract

Section: Explainable Ai Methodologiesmentioning

confidence: 99%

Section: Experiments Designmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Towards Reconciling Usability and Usefulness of Explainable AI Methodologies

Tambwekar¹,

Gombolay²

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Finally, Warm Start Reinforcement Learning (WSRL) Cheng et al (2018); Zhu & Liao (2017) aims at initializing the policy of the agent with another policy pre-trained on the same task. Domain knowledge, i.e., information about the environment known by the designers but not initially known by the agent, can be used to kickstart learning, either through imitation learning on expert demonstrations Cheng et al (2018), directly encoding it via propositional rules in the neural network architecture of the agent Silva & Gombolay (2021), or actively learning to imitate a transferred policy Wexler et al (2022).…”

Section: Learningmentioning

confidence: 99%

Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem

Plisnier¹,

Steckelmacher²,

Willems³

et al. 2023

Preprint

View full text Add to dashboard Cite

Many instances of similar or almost-identical industrial machines or tools are often deployed at once, or in quick succession. For instance, a particular model of air compressor may be installed at hundreds of customers. Because these tools perform distinct but highly similar tasks, it is interesting to be able to quickly produce a high-quality controller for machine N +1 given the controllers already produced for machines 1..N . This is even more important when the controllers are learned through Reinforcement Learning, as training takes time, energy and other resources. In this paper, we apply Policy Intersection, a Policy Shaping method, to help a Reinforcement Learning agent learn to solve a new variant of a compressors control problem faster, by transferring knowledge from several previously learned controllers. We show that our approach outperforms loading an old controller, and significantly improves performance in the long run.

show abstract

Solving Complex Sequential Decision-Making Problems by Deep Reinforcement Learning with Heuristic Rules

Nguyen

Huynh‐The

et al. 2023

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Encoding Human Domain Knowledge to Warm Start Reinforcement Learning

Cited by 21 publications

References 29 publications

Towards Reconciling Usability and Usefulness of Explainable AI Methodologies

Towards Reconciling Usability and Usefulness of Explainable AI Methodologies

Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem

Solving Complex Sequential Decision-Making Problems by Deep Reinforcement Learning with Heuristic Rules

Contact Info

Product

Resources

About