Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence 2019
DOI: 10.24963/ijcai.2019/320
|View full text |Cite
|
Sign up to set email alerts
|

Curriculum Learning for Cumulative Return Maximization

Abstract: Curriculum learning has been successfully used in reinforcement learning to accelerate the learning process, through knowledge transfer between tasks of increasing complexity. Critical tasks, in which suboptimal exploratory actions must be minimized, can benefit from curriculum learning, and its ability to shape exploration through transfer. We propose a task sequencing algorithm maximizing the cumulative return, that is, the return obtained by the agent across all the learning episodes. By maximizing the cumu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 3 publications
0
5
0
Order By: Relevance
“…This section starts with the comparison between the Exponential progression and the Friction-Based progression. It then compares the performance of our approach with two other state-of-the-art Curriculum learning algorithms, as outlined in the Related Work section: Florensa et al [2] and Foglino et al [3].…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…This section starts with the comparison between the Exponential progression and the Friction-Based progression. It then compares the performance of our approach with two other state-of-the-art Curriculum learning algorithms, as outlined in the Related Work section: Florensa et al [2] and Foglino et al [3].…”
Section: Resultsmentioning
confidence: 99%
“…The formulation of sequencing as a combinatorial optimization problem [4] over the intermediate tasks lends itself to the design of globally optimal sequencing algorithms. One such algorithm is Heuristic Task Sequencing for Cumulative Return [3] (HTS-CR), which is a complete anytime algorithm, converging to the optimal curriculum of a maximum length. Due to this guarantee of optimality, we use HTS-CR as one of the baselines to evaluate our sequencing method.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…While metaheuristic algorithms are broadly applicable, it is also possible to create specific heuristic search methods targeted at particular problems, such as task sequencing with a specific transfer metric objective. Foglino et al (2019b) introduce one such heuristic search algorithm, designed to optimize for the cumulative return. Their approach begins by computing transferability between all pairs of tasks, using a simulator to estimate the cumulative return attained by using one task as a source for another.…”
Section: Combinatorial Optimization and Searchmentioning
confidence: 99%