The 2010 International Joint Conference on Neural Networks (IJCNN) 2010
DOI: 10.1109/ijcnn.2010.5596468
|View full text |Cite
|
Sign up to set email alerts
|

Deep auto-encoder neural networks in reinforcement learning

Abstract: Abstract-This paper discusses the effectiveness of deep autoencoder neural networks in visual reinforcement learning (RL) tasks. We propose a framework for combining deep autoencoder neural networks (for learning compact feature spaces) with recently-proposed batch-mode RL algorithms (for learning policies). An emphasis is put on the data-efficiency of this combination and on studying the properties of the feature spaces automatically constructed by the deep auto-encoders. These feature spaces are empirically … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
149
0
1

Year Published

2011
2011
2020
2020

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 281 publications
(150 citation statements)
references
References 18 publications
0
149
0
1
Order By: Relevance
“…In reinforcement learning, for example, using adaptive unsupervised preprocessors have become increasingly popular (e.g., [22]). However, features generated by algorithms that forget what they learned before, may no longer be valid for parts of the environment the agent has not visited recently.…”
Section: Discussionmentioning
confidence: 99%
“…In reinforcement learning, for example, using adaptive unsupervised preprocessors have become increasingly popular (e.g., [22]). However, features generated by algorithms that forget what they learned before, may no longer be valid for parts of the environment the agent has not visited recently.…”
Section: Discussionmentioning
confidence: 99%
“…In the literature, this growing batch approach can be found in several different guises; the number of alternations between episodes of exploration and episodes of learning can be in the whole range of being as close to the pure batch approach as using only two iterations to recal-culating the policy after every few interactions-e.g. after finishing one episode in a shortest-path problem (Kalyanakrishnan and Stone, 2007;Lange and Riedmiller, 2010a). In practice, the growing batch approach is the modeling of choice when applying batch reinforcement learning algorithms to real systems.…”
Section: The Growing Batch Learning Problemmentioning
confidence: 99%
“…The DFQ algorithm has been successfully applied to learning visual control policies in a grid-world benchmark problem-using synthesized (Lange and Riedmiller, 2010b) as well as screen-captured images (Lange and Riedmiller, 2010a)-and to controlling a slot-car racer only on the basis of the raw image data captured by a top-mounted camera (Lange, 2010).…”
Section: Deep Fitted Q Iterationmentioning
confidence: 99%
“…Unsupervised learning of deep auto encoder network was integrated into batch-reinforcement learning in [2,3]. The near-optimal policy was demonstrated automatically by learned feature spaces in grid-world like task.…”
Section: Introductionmentioning
confidence: 99%