Encyclopedia of Robotics 2018
DOI: 10.1007/978-3-642-41610-1_27-1
|View full text |Cite
|
Sign up to set email alerts
|

Learning from Demonstration (Programming by Demonstration)

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(29 citation statements)
references
References 43 publications
0
29
0
Order By: Relevance
“…Although our platform supports controlling both arms and the head, for simplicity we only subjected the right arm to control and froze all other joints except when resetting to initial states. 2 During execution, the policy π θ generates the control u t = π θ (o t ) given current observation o t . Observations and controls are both collected at 10 Hz.…”
Section: A Neural Network Control Policiesmentioning
confidence: 99%
See 1 more Smart Citation
“…Although our platform supports controlling both arms and the head, for simplicity we only subjected the right arm to control and froze all other joints except when resetting to initial states. 2 During execution, the policy π θ generates the control u t = π θ (o t ) given current observation o t . Observations and controls are both collected at 10 Hz.…”
Section: A Neural Network Control Policiesmentioning
confidence: 99%
“…Imitation learning is a class of methods for acquiring skills by observing demonstrations (see, e.g., [1], [2], [3] for surveys). It has been applied successfully to a wide range of domains in robotics, for example to autonomous driving [4], [5], [6], autonomous helicopter flight [7], gesturing [8], and manipulation [9], [10].…”
Section: Introductionmentioning
confidence: 99%
“…D ← D ∪ GetDemonstrations(d) 4: θ * ← arg min θ L(D, θ) #relearn 5: t ← 0 # re-start episode in current context 6: Ω = AverageUncertainty() # adapt query threshold 7: a t , σ t =π θ * (x t ) # re-select action 8 Fig. 1: The controller has to perform well on all tasks it faces sequentially with limited requests for task-specific demonstrations.…”
Section: Algorithm 1 Select Action and Train If Necessarymentioning
confidence: 99%
“…Prior work in automation has explored learning from demonstrations for highly unstructured tasks such as grasping in clutter, scooping, and pipetting [16], [19]. Past work has also addressed the specific problem of learning from demonstrations under constraints [4], [5]. A popular method for dealing with unknown constraints is to identify essential components of multiple successful trajectories based on variances in the corresponding states and then to produce a learned policy that also exhibits those components [6].…”
Section: Related Work Learning From Demonstrations In Automation mentioning
confidence: 99%