2020
DOI: 10.48550/arxiv.2005.14419
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reinforcement Learning

Olivier Buffet,
Olivier Pietquin,
Paul Weng

Abstract: Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decisionmaking problem where, at every time step, it observes its state, performs an action, receives a reward and moves to a new state. An RL agent learns by trial and error a good policy (or controller) based on observations and numeric reward feedback on the previously performed action. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 68 publications
0
1
0
Order By: Relevance
“…All of these challenges aim to achieve an optimal estimate based on the value of a state. There are policy-based and value-based methods [9] that can be applied to a model-free setting such as the financial market. [10].…”
Section: Introductionmentioning
confidence: 99%
“…All of these challenges aim to achieve an optimal estimate based on the value of a state. There are policy-based and value-based methods [9] that can be applied to a model-free setting such as the financial market. [10].…”
Section: Introductionmentioning
confidence: 99%