Multi-task Reinforcement Learning in Reproducing Kernel Hilbert Spaces via Cross-learning

Cervino, Juan; Bazerque, Juan Andres; Calvo-Fullana, Miguel; Ribeiro, Alejandro

doi:10.48550/arxiv.2008.11895

Cited by 1 publication

(2 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, recent results have shown that even in those cases, problems akin to (PCL) have tractable duality gaps [14], motivating primal-dual approaches. Nonetheless, for the specific case of the cross-learning Algorithm 1, previous results have shown its converge in high probability to a neighborhood of a first-order stationary point of problem (PCL) in the context of reinforcement learning [11].…”

Section: Algorithm Constructionmentioning

confidence: 99%

See 1 more Smart Citation

Multi-task Supervised Learning via Cross-learning

Juan¹,

Bazerque²,

Calvo-Fullana³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

In this paper we consider a problem known as multi-task learning, consisting of fitting a set of classifier or regression functions intended for solving different tasks. In our novel formulation, we couple the parameters of these functions, so that they learn in their task specific domains while staying close to each other. This facilitates cross-fertilization in which data collected across different domains help improving the learning performance at each other task. First, we present a simplified case in which the goal is to estimate the means of two Gaussian variables, for the purpose of gaining some insights on the advantage of the proposed cross-learning strategy. Then we provide a stochastic projected gradient algorithm to perform crosslearning over a generic loss function. If the number of parameters is large, then the projection step becomes computationally expensive. To avoid this situation, we derive a primal-dual algorithm that exploits the structure of the dual problem, achieving a formulation whose complexity only depends on the number of tasks. Preliminary numerical experiments for image classification by neural networks trained on a dataset divided in different domains corroborate that the cross-learned function outperforms both the task-specific and the consensus approaches.

show abstract

Section: Algorithm Constructionmentioning

confidence: 99%

“…In this paper, we take a constrained approach to the multi-task learning problem. Our formulation is based on the cross-learning framework [10,11]. Originally used for learning policies in a reinforcement learning scenario, nevertheless, its principles can be also applied to the multi-task supervised learning problem.…”

Section: Introductionmentioning

confidence: 99%