2017
DOI: 10.1016/j.artint.2015.09.013
|View full text |Cite
|
Sign up to set email alerts
|

Scalable transfer learning in heterogeneous, dynamic environments

Abstract: Reinforcement learning is a plausible theoretical basis for developing self-learning, autonomous agents or robots that can effectively represent the world dynamics and efficiently learn the problem features to perform different tasks in different environments. The computational costs and complexities involved, however, are often prohibitive for real-world applications. This study introduces a scalable methodology to learn and transfer knowledge of the transition (and reward) models for model-based reinforcemen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 29 publications
0
8
0
Order By: Relevance
“…Papers [13] and [16] also deal with learning, but put a strong emphasis on managing the complexity of doing so, thus addressing RC3. [13] introduces a scalable methodology to learn and transfer knowledge of the transition (and reward) models for model-based reinforcement learning in a complex world.…”
Section: Papers Focusing On Ca1: Robots That Knowmentioning
confidence: 99%
See 1 more Smart Citation
“…Papers [13] and [16] also deal with learning, but put a strong emphasis on managing the complexity of doing so, thus addressing RC3. [13] introduces a scalable methodology to learn and transfer knowledge of the transition (and reward) models for model-based reinforcement learning in a complex world.…”
Section: Papers Focusing On Ca1: Robots That Knowmentioning
confidence: 99%
“…[13] introduces a scalable methodology to learn and transfer knowledge of the transition (and reward) models for model-based reinforcement learning in a complex world. The authors use a formulation of Markov decision processes that support efficient online-learning of relevant problem features in order to approximate world dynamics.…”
Section: Papers Focusing On Ca1: Robots That Knowmentioning
confidence: 99%
“…It combines the improved TOPSIS (technique for order preference similar to an ideal solution) algorithm in terms of multi-objective decision and the multi-step backtracking Q(λ) algorithm about random optimization ability, so that it is remarkably applied in solving of real-time dynamic control issues of active loads. (c) TRL [66][67][68][69]. It is a novel algorithm based on a high integration of multi-agent collaboration, reinforcement learning and transfer learning in term of an efficient information utilization of historical optimization tasks, perceptibly can be applied in field of fast dynamic optimization of active loads.…”
Section: Situations Of Distributed Power Generationmentioning
confidence: 99%
“…The nature of knowledge should be ex plainable and transferable to other agents. In [3], the authors point out that there are still many open problems and assumptions to address, where a transfer learning framework would serve as an intelligent base platform for monitoring, problem solving, and general decision support in heterogeneous dynamic environments, leading to a new generation of human-interactive robots.…”
mentioning
confidence: 99%