CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning

Benjamins, Carolin; Eimer, Theresa; Schubert, Frederik; Biedenkapp, André; Rosenhahn, Bodo; Hutter, Frank; Lindauer, Marius

doi:10.48550/arxiv.2110.02102

Cited by 2 publications

(2 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, we present additional experiments in Appendix suggesting that our model can potentially generalize to non-Markovian and state-dependent settings. While we presented several experiments in various environments, further experimental evaluation is required, e.g., using Benjamins et al (2021).…”

Section: Conclusion and Discussionmentioning

confidence: 99%

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

Ren¹,

Sootla²,

Jafferjee³

et al. 2022

Preprint

View full text Add to dashboard Cite

We consider a context-dependent Reinforcement Learning (RL) setting, which is characterized by: a) an unknown finite number of not directly observable contexts; b) abrupt (discontinuous) context changes occurring during an episode; and c) Markovian context evolution. We argue that this challenging case is often met in applications and we tackle it using a Bayesian approach and variational inference. We adapt a sticky Hierarchical Dirichlet Process (HDP) prior for model learning, which is arguably best-suited for Markov process modeling. We then derive a context distillation procedure, which identifies and removes spurious contexts in an unsupervised fashion. We argue that the combination of these two components allows to infer the number of contexts from data thus dealing with the context cardinality assumption. We then find the representation of the optimal policy enabling efficient policy learning using off-the-shelf RL algorithms. Finally, we demonstrate empirically (using gym environments cart-pole swing-up, drone, intersection) that our approach succeeds where state-of-the-art methods of other frameworks fail and elaborate on the reasons for such failures.

show abstract

Section: Conclusion and Discussionmentioning

confidence: 99%

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

Ren¹,

Sootla²,

Jafferjee³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In the literature, several methods exist that include explicit contextual information. For example, Benjamins et al [41], [42] introduced a framework designed to solve CMDPs and a benchmark library. The framework includes information such as gravity, target distance, actuator strength, and joint stiffness in the learning process.…”

Section: Explicit Context-based Methodsmentioning

confidence: 99%

Affordance-Based Human–Robot Interaction With Reinforcement Learning

et al. 2023

View full text Add to dashboard Cite

Planning precise manipulation in robotics to perform grasp and release-related operations, while interacting with humans is a challenging problem. Reinforcement learning (RL) has the potential to make robots attain this capability. In this paper, we propose an affordance-based human-robot interaction (HRI) framework, aiming to reduce the action space size that would considerably impede the exploration efficiency of the agent. The framework is based on a new algorithm called Contextual Q-learning (CQL). We first show that the proposed algorithm trains in a reduced amount of time (2.7 seconds) and reaches an 84% of success rate. This suits the robot's learning efficiency to observe the current scenario configuration and learn to solve it. Then, we empirically validate the framework for implementation in HRI real-world scenarios. During the HRI, the robot uses semantic information from the state and the optimal policy of the last training step to search for relevant changes in the environment that may trigger the generation of a new policy. INDEX TERMSQ-learning, Robotics, Affordances, Robot learning, Human-Robot Interaction [14]. Passive observation is when the robot learns from the user through video streams [15]. With teleoperation, the user

show abstract

CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning

Cited by 2 publications

References 17 publications

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

Affordance-Based Human–Robot Interaction With Reinforcement Learning

Contact Info

Product

Resources

About