DOI: 10.5821/dissertation-2117-95729
|View full text |Cite
|
Sign up to set email alerts
|

Regularized approximate policy iteration using kernel for on-line reinforcement learning

Gennaro Esposito

Abstract: By using Reinforcement Learning (RL), an autonomous agent interacting with the environment can learn how to take adequate actions for every situation in order to optimally achieve its own goal. RL provides a general methodology able to solve uncertain and complex decision problems which may be present in many real-world applications. RL problems are usually modeled as a Markov Decision Processes (MDPs) deeply studied in the literature. The main peculiarity of a RL algorithm is that the RL agent is assumed to l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 74 publications
(200 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?