2020
DOI: 10.48550/arxiv.2008.11895
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multi-task Reinforcement Learning in Reproducing Kernel Hilbert Spaces via Cross-learning

Juan Cervino,
Juan Andres Bazerque,
Miguel Calvo-Fullana
et al.

Abstract: Reinforcement learning (RL) is a framework to optimize a control policy using rewards that are revealed by the system as a response to a control action. In its standard form, RL involves a single agent that uses its policy to accomplish a specific task. These methods require large amounts of reward samples to achieve good performance, and may not generalize well when the task is modified, even if the new task is related. In this paper we are interested in a collaborative scheme in which multiple agents with di… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 14 publications
0
2
0
Order By: Relevance
“…However, recent results have shown that even in those cases, problems akin to (PCL) have tractable duality gaps [14], motivating primal-dual approaches. Nonetheless, for the specific case of the cross-learning Algorithm 1, previous results have shown its converge in high probability to a neighborhood of a first-order stationary point of problem (PCL) in the context of reinforcement learning [11].…”
Section: Algorithm Constructionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, recent results have shown that even in those cases, problems akin to (PCL) have tractable duality gaps [14], motivating primal-dual approaches. Nonetheless, for the specific case of the cross-learning Algorithm 1, previous results have shown its converge in high probability to a neighborhood of a first-order stationary point of problem (PCL) in the context of reinforcement learning [11].…”
Section: Algorithm Constructionmentioning
confidence: 99%
“…In this paper, we take a constrained approach to the multi-task learning problem. Our formulation is based on the cross-learning framework [10,11]. Originally used for learning policies in a reinforcement learning scenario, nevertheless, its principles can be also applied to the multi-task supervised learning problem.…”
Section: Introductionmentioning
confidence: 99%