2021 IEEE International Conference on Robotics and Automation (ICRA) 2021
DOI: 10.1109/icra48506.2021.9561097
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Self-Supervised Data Collection for Offline Robot Learning

Abstract: A practical approach to robot reinforcement learning is to first collect a large batch of real or simulated robot interaction data, using some data collection policy, and then learn from this data to perform various tasks, using offline learning algorithms. Previous work focused on manually designing the data collection policy, and on tasks where suitable policies can easily be designed, such as random picking policies for collecting data about object grasping. For more complex tasks, however, it may be diffic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 16 publications
0
6
0
Order By: Relevance
“…Multi-task Offline RL. Recent works proposed to learn multiple tasks from pre-collected datasets, starting with methods [32] that generate goals to improve the offline data collection process in a selfsupervised way. This connection has also been studied in the supervised setting [9,33] and when learning hierarchical policies [10].…”
Section: Related Workmentioning
confidence: 99%
“…Multi-task Offline RL. Recent works proposed to learn multiple tasks from pre-collected datasets, starting with methods [32] that generate goals to improve the offline data collection process in a selfsupervised way. This connection has also been studied in the supervised setting [9,33] and when learning hierarchical policies [10].…”
Section: Related Workmentioning
confidence: 99%
“…A straightforward implementation of Data Generation consists in collecting data with the random uniform policy. However, this approach is not guaranteed to explore the state space effectively (Mutti et al, 2021;Endrawis et al, 2021). In the pipeline, we consider the state-of-the-art solutions proposed by Pathak et al (2019), and Mutti et al (2021).…”
Section: Data Generationmentioning
confidence: 99%
“…In [23], high-quality and diverse data are obtained with human demonstrations, which proves to broadly improve the performance of the robotic learning tasks. To reduce manual effort, efficient self-supervised data collection has been introduced for offline robot learning with task-relevant loss functions [24], [25]. Alternatively, online data collection without manual intervention can be achieved through robot interactions with the environments [26].…”
Section: B Data Collection and Sim-to-realmentioning
confidence: 99%