2021
DOI: 10.48550/arxiv.2103.16817
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…Related to challenges of generalisation surrounding reward functions, for many real-world problems designing good reward functions is very difficult. A promising approach is using inverse RL [168,169,IRL] to learn a reward function from human demonstrations [170,171,172,173], rather than hand-crafting reward functions for each task. This is often more time-efficient, as demonstrating a task is easier than specifying a reward function for it.…”
Section: Real World Reinforcement Learning Generalisationmentioning
confidence: 99%
“…Related to challenges of generalisation surrounding reward functions, for many real-world problems designing good reward functions is very difficult. A promising approach is using inverse RL [168,169,IRL] to learn a reward function from human demonstrations [170,171,172,173], rather than hand-crafting reward functions for each task. This is often more time-efficient, as demonstrating a task is easier than specifying a reward function for it.…”
Section: Real World Reinforcement Learning Generalisationmentioning
confidence: 99%
“…However, these require a large amount of training data. Recent works [5], [6] leverage action recognition models, such as the action classifiers trained on the 20BN Something-Something dataset [29], to identify whether the robot is performing the desired task. However, while these classifiers are useful for identifying the class of motions for short interactions, we show that they do not retain enough information to analyze tasks with longer duration and multiple repetitions.…”
Section: Learning From Human Demonstrationsmentioning
confidence: 99%
“…In this work, we address how robots can appropriately represent and learn periodic policies by watching humans. While prior works considered learning manipulation skills from human demonstrations [1]- [6], less attention has been given to periodic tasks. These tasks repeat similar motion with only small differences between repetitions.…”
Section: Introductionmentioning
confidence: 99%
“…and task learning that can generalize across multiple environments (How should I do it?). Recent works have just started to explored the use of pre-existing foundational models from language and vision towards robotics [9]- [13], and also the development of robotics-specific foundational models [8,14].…”
Section: Introductionmentioning
confidence: 99%