2020
DOI: 10.48550/arxiv.2006.15009
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Unifying Framework for Reinforcement Learning and Planning

Abstract: Sequential decision making, commonly formalized as Markov Decision Process optimization, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are planning and reinforcement learning. Both research fields largely have their own research communities. However, if both research fields solve the same problem, then we should be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying framework for reinforcement learning and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
14
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(14 citation statements)
references
References 61 publications
0
14
0
Order By: Relevance
“…The third crucial consideration is: how to actually plan? Of course, we do not aim to provide a full survey of planning methods here, but refer the reader to Moerland et al (2020a) for a recent framework to categorize planning and RL methods. Instead, we focus on some crucial decisions we have to make for the integration, on a) the use of potential differentiability of the model, b) the direction of planning, c) the breadth and depth of the plan, and d) the way of dealing with uncertainty.…”
Section: How To Plan?mentioning
confidence: 99%
See 4 more Smart Citations
“…The third crucial consideration is: how to actually plan? Of course, we do not aim to provide a full survey of planning methods here, but refer the reader to Moerland et al (2020a) for a recent framework to categorize planning and RL methods. Instead, we focus on some crucial decisions we have to make for the integration, on a) the use of potential differentiability of the model, b) the direction of planning, c) the breadth and depth of the plan, and d) the way of dealing with uncertainty.…”
Section: How To Plan?mentioning
confidence: 99%
“…Hester and Stone (2012b) gives a book-chapter presentation of model-based RL methods, but their work does not provide a full overview, nor does it incorporate the vast recent literature on neural network approximation in model-based reinforcement learning. Moerland et al (2020a) presents a framework for reinforcement learning and planning that disentangles their common underlying dimensions, but does not focus on their integration. In some sense, Moerland et al (2020a) looks 'inside' each planning or reinforcement learning cycle, strapping their shared algorithmic space down into its underlying dimensions.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations