2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017
DOI: 10.1109/iros.2017.8205962
|View full text |Cite
|
Sign up to set email alerts
|

Neural networks for incremental dimensionality reduced reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…The main function of SL is to generate a low-dimensional state space in which RL policy can perform well and be efficient. The studies [25][26][27][28][29] adapt SL methods to make the RL training process faster by separating the representation learning process and policy learning process.…”
Section: Related Workmentioning
confidence: 99%
“…The main function of SL is to generate a low-dimensional state space in which RL policy can perform well and be efficient. The studies [25][26][27][28][29] adapt SL methods to make the RL training process faster by separating the representation learning process and policy learning process.…”
Section: Related Workmentioning
confidence: 99%
“…The purpose of learning a latent space embedding is to reduce the dimensionality of the input state-space the policy network learns from. As the policy network gathers experience by interacting with the environment in real time, a reduction in the dimensionality of the state space leads to accelerated convergence [18].…”
Section: Learning Latent Space Representationmentioning
confidence: 99%
“…The purpose of latent space learning is to reduce the time required for the policy network to reach convergence. As the policy network gathers observations in real time, it is costly to explore large state spaces, therefore a reduction in the dimensionality of the state space leads to accelerated convergence [16]. Desired characteristics of a dimensionality reduction technique is that it is smooth, continuous and consistent [17], so that a policy network can efficiently learn how to perceive the encoded environment.…”
Section: Learning Latent Space Representation -Cross-modal Variationa...mentioning
confidence: 99%