Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021
DOI: 10.24963/ijcai.2021/684
|View full text |Cite
|
Sign up to set email alerts
|

Combining Reinforcement Learning and Causal Models for Robotics Applications

Abstract: The relation between Reinforcement learning (RL) and Causal Modeling(CM) is an underexplored area with untapped potential for any learning task. In this extended abstract of our Ph.D. research proposal, we present a way to combine both areas to improve their respective learning processes, especially in the context of our application area (service robotics). The preliminary results obtained so far are a good starting point for thinking about the success of our research project.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
19
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(19 citation statements)
references
References 2 publications
0
19
0
Order By: Relevance
“…The experimental results in the deterministic and stochastic versions of the test scenarios, show the advantages of CARL, not only over the previous work, but also compared to traditional model-free and modelbased algorithms. Fourth, the feasibility hypothesised in [11] of using the learned models for zero-shot transfer learning was verified using a more complex version of the Taxi task. Finally, the method's scalability to high-dimensional states, where the action-value function needs to be represented with deep neural networks, was verified.…”
Section: Introductionmentioning
confidence: 98%
See 4 more Smart Citations
“…The experimental results in the deterministic and stochastic versions of the test scenarios, show the advantages of CARL, not only over the previous work, but also compared to traditional model-free and modelbased algorithms. Fourth, the feasibility hypothesised in [11] of using the learned models for zero-shot transfer learning was verified using a more complex version of the Taxi task. Finally, the method's scalability to high-dimensional states, where the action-value function needs to be represented with deep neural networks, was verified.…”
Section: Introductionmentioning
confidence: 98%
“…Recently, in [11] the authors present a novel way to integrate reinforcement learning with causal discovery in the context of online MPD settings as the one described above, with an emphasis on reducing the policy learning time while inducing the underlying causal modes. In alternating stages, the proposed system learns a two-slice causal Dynamic Bayesian Network for each of the agent's actions and use those models to guide the action selection in the traditional RL episodes.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations