2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017
DOI: 10.1109/iros.2017.8202134
|View full text |Cite
|
Sign up to set email alerts
|

Virtual-to-real deep reinforcement learning: Continuous control of mobile robots for mapless navigation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
475
0
6

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 700 publications
(484 citation statements)
references
References 8 publications
3
475
0
6
Order By: Relevance
“…In robotics, simulations can be employed as an additional source of data [50], [51]. Physics simulators have been extensively developed for fields such as computer graphics or video gaming and one could potentially generate a vast amount of data.…”
Section: Motivating Examplesmentioning
confidence: 99%
“…In robotics, simulations can be employed as an additional source of data [50], [51]. Physics simulators have been extensively developed for fields such as computer graphics or video gaming and one could potentially generate a vast amount of data.…”
Section: Motivating Examplesmentioning
confidence: 99%
“…A. Reinforcement learning in robotics with simulations This work was motivated by the popularity of using reinforcement learning in robotics, despite RL being known to require large number of training samples and thus making it difficult to apply to robotics [9]. Part of this RL plus robotics work focuses on training policies in simulations and then transferring them to real robot, with or without further training on the robot [1].…”
Section: Arxiv:190500741v1 [Cslg] 2 May 2019mentioning
confidence: 99%
“…The "confidence value" motioned above of the actor is the degree of confirmation on which action the robot chooses to perform. For example, in a piece of sample from training data, the private Network 1 evaluates Q-values of different actions to (85, 85, 84, 83, 86), but the evaluation of the k-G sharing network is (20,20,100,10,10). In this case, we are more confident on actor of k-G sharing network, because it has significant differentiation in the scoring process.…”
Section: B Knowledge Fusion Algorithm In Cloudmentioning
confidence: 99%