Curriculum Learning for Cumulative Return Maximization

Foglino, Francesco; Christakou, Christiano Coletto; Gutierrez, Ricardo Luna; Leonetti, Matteo

doi:10.24963/ijcai.2019/320

Cited by 6 publications

(5 citation statements)

References 3 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This section starts with the comparison between the Exponential progression and the Friction-Based progression. It then compares the performance of our approach with two other state-of-the-art Curriculum learning algorithms, as outlined in the Related Work section: Florensa et al [2] and Foglino et al [3].…”

Section: Resultsmentioning

confidence: 99%

“…The formulation of sequencing as a combinatorial optimization problem [4] over the intermediate tasks lends itself to the design of globally optimal sequencing algorithms. One such algorithm is Heuristic Task Sequencing for Cumulative Return [3] (HTS-CR), which is a complete anytime algorithm, converging to the optimal curriculum of a maximum length. Due to this guarantee of optimality, we use HTS-CR as one of the baselines to evaluate our sequencing method.…”

Section: Related Workmentioning

confidence: 99%

“…When using HTS-CR [3] as one of our baselines, we used transfer between tasks; in the case of the Grid World domain, the transfer method is the one outlined in the paper, on the other hand, on the Point Maze domain, the transfer method consisted in directly transferring the neural network from one task to the next. On both domains, the intermediate tasks used by HTS-CR were a subset of the tasks used by our approach; this gives some insight on the quality of the solution found by our approach compared to what is possible with a standard curriculum.…”

Section: Implementation Detailsmentioning

confidence: 99%

“…In order to show the potential of our framework, we compared the Frictionbased progression with other two state-of-the-art Curriculum Learning algorithms HTS-CR [3] and Reverse Curriculum Generation [2]. Figures 5 and 6 report the average and 95 percent confidence intervals on both test domains.…”

Section: Comparison With State-of-the-artmentioning

confidence: 99%

See 3 more Smart Citations

Curriculum Learning with a Progression Function

Bassich¹,

Foglino²,

Leonetti³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Curriculum Learning for Reinforcement Learning is an increasingly popular technique that involves training an agent on a defined sequence of intermediate tasks, called a Curriculum, to increase the agent's performance and learning speed. This paper introduces a novel paradigm for automatic curriculum generation based on a progression of task complexity. Different progression functions are introduced, including an autonomous online task progression based on the performance of the agent. The progression function also determines how long the agent should train on each intermediate task, which is an open problem in other task-based curriculum approaches. The benefits and wide applicability of our approach are shown by empirically comparing its performance to two state-of-the-art Curriculum Learning algorithms on a grid world and on a complex simulated navigation domain.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Implementation Detailsmentioning

confidence: 99%

Section: Comparison With State-of-the-artmentioning

confidence: 99%

See 2 more Smart Citations

Curriculum Learning with a Progression Function

Bassich¹,

Foglino²,

Leonetti³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…While metaheuristic algorithms are broadly applicable, it is also possible to create specific heuristic search methods targeted at particular problems, such as task sequencing with a specific transfer metric objective. Foglino et al (2019b) introduce one such heuristic search algorithm, designed to optimize for the cumulative return. Their approach begins by computing transferability between all pairs of tasks, using a simulator to estimate the cumulative return attained by using one task as a source for another.…”

Section: Combinatorial Optimization and Searchmentioning

confidence: 99%

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Narvekar,

Peng,

Leonetti

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Reinforcement learning (RL) is a popular paradigm for addressing sequential decision tasks in which the agent has only limited environmental feedback. Despite many advances over the past three decades, learning in many domains still requires a large amount of interaction with the environment, which can be prohibitively expensive in realistic scenarios. To address this problem, transfer learning has been applied to reinforcement learning such that experience gained in one task can be leveraged when starting to learn the next, harder task. More recently, several lines of research have explored how tasks, or data samples themselves, can be sequenced into a curriculum for the purpose of learning a problem that may otherwise be too difficult to learn from scratch. In this article, we present a framework for curriculum learning (CL) in reinforcement learning, and use it to survey and classify existing CL methods in terms of their assumptions, capabilities, and goals. Finally, we use our framework to find open problems and suggest directions for future RL curriculum learning research.

show abstract

An Optimization Framework for Task Sequencing in Curriculum Learning

Foglino

Christakou

Leonetti

2019

2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Self Cite

View full text Add to dashboard Cite

Curriculum learning in reinforcement learning is used to shape exploration by presenting the agent with increasingly complex tasks. The idea of curriculum learning has been largely applied in both animal training and pedagogy. In reinforcement learning, all previous task sequencing methods have shaped exploration with the objective of reducing the time to reach a given performance level. We propose novel uses of curriculum learning, which arise from choosing different objective functions. Furthermore, we define a general optimization framework for task sequencing and evaluate the performance of popular metaheuristic search methods on several tasks. We show that curriculum learning can be successfully used to: improve the initial performance, take fewer suboptimal actions during exploration, and discover better policies.

show abstract

Curriculum Learning for Cumulative Return Maximization

Cited by 6 publications

References 3 publications

Curriculum Learning with a Progression Function

Curriculum Learning with a Progression Function

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

An Optimization Framework for Task Sequencing in Curriculum Learning

Contact Info

Product

Resources

About