2018
DOI: 10.48550/arxiv.1802.01557
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
75
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 72 publications
(75 citation statements)
references
References 36 publications
0
75
0
Order By: Relevance
“…While most prior work on deep visuomotor learning trains a single task in a single domain [7], [8], [9], [10], [11], [12], [13], [14], [15], our goal is not to develop better learning methods, but rather to illustrate how generic multi-domain, multi-task datasets can be used with existing algorithms to boost the generalization of new tasks in new domains. Prior work on multi-task reinforcement learning [5] has shown that data from other tasks can boost generalization of new tasks, however this study is carried out in a single domain.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…While most prior work on deep visuomotor learning trains a single task in a single domain [7], [8], [9], [10], [11], [12], [13], [14], [15], our goal is not to develop better learning methods, but rather to illustrate how generic multi-domain, multi-task datasets can be used with existing algorithms to boost the generalization of new tasks in new domains. Prior work on multi-task reinforcement learning [5] has shown that data from other tasks can boost generalization of new tasks, however this study is carried out in a single domain.…”
Section: Related Workmentioning
confidence: 99%
“…We provide an overview of the most related datasets in Figure 2. Most existing robot datasets, such as MIME [16], DAML [10], RoboTurk [17], [18], and many others [19], [20], [21], [22], [23], [24], [5] only feature a single domain, making them difficult to use for boosting the generalization in other domains. Merging multiple existing datasets into one multi-domain dataset is difficult due to inconsistencies in data collection protocols, time discretization, robot morphologies, and sensors.…”
Section: Related Workmentioning
confidence: 99%
“…The trajectories are recorded using either kinesthetic teaching [56] or teleoperation [57]. Follow-up papers [58], [55] adopt deep learning approaches to solve the task. Similarly, Yu et al [58] teach the robot to perform tasks from a demonstration video.…”
Section: B Model-free Graspingmentioning
confidence: 99%
“…Follow-up papers [58], [55] adopt deep learning approaches to solve the task. Similarly, Yu et al [58] teach the robot to perform tasks from a demonstration video. In this work, the robot policy is directly predicted from hidden layers of the network.…”
Section: B Model-free Graspingmentioning
confidence: 99%
“…Hand-designed unsupervised losses have been used to adapt to distribution shifts with rotation-prediction [47] or entropyminimization [53], as well as to encode inductive biases [3]. Unsupervised losses have also been metalearned for learning to encode human demonstrations [57], few-shot learning exploiting unsupervised information [31,4], and learning to adapt to group distribution shifts [58]. In contrast, our unsupervised loss only takes the single query we care about, thus imposing no additional assumptions on top of standard prediction problems, and takes the form of a conservation law for predicted sequences.…”
Section: Related Workmentioning
confidence: 99%