Proceedings of 2012 IEEE International Conference on Automation, Quality and Testing, Robotics 2012
DOI: 10.1109/aqtr.2012.6237681
|View full text |Cite
|
Sign up to set email alerts
|

Imitation learning with non-parametric regression

Abstract: Abstract-Humans are very fast learners. Yet, we rarely learn a task completely from scratch. Instead, we usually start with a rough approximation of the desired behavior and take the learning from there. In this paper, we use imitation to quickly generate a rough solution to a robotic task from demonstrations, supplied as a collection of state-space trajectories. Appropriate control actions needed to steer the system along the trajectories are then automatically learned in the form of a (nonlinear) statefeedba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 17 publications
(15 reference statements)
0
5
0
Order By: Relevance
“…where σ s is a factor determining the length of the running average that was set to a value of 0.95, as in [12]. A low value for η s means that the sample and its closest neighbors live in the neighborhood of the same affine hyperplane.…”
Section: B Memory Storagementioning
confidence: 99%
See 2 more Smart Citations
“…where σ s is a factor determining the length of the running average that was set to a value of 0.95, as in [12]. A low value for η s means that the sample and its closest neighbors live in the neighborhood of the same affine hyperplane.…”
Section: B Memory Storagementioning
confidence: 99%
“…The strategy implemented in [12] to reduce noise is to replace the nearest neighbors y s with the estimated values using the local modelĥ(x s ). The local model is a linear least squares solution of the nearest neighbors, which is the best average of the first order relation present in the neighbors.…”
Section: B Memory Storagementioning
confidence: 99%
See 1 more Smart Citation
“…Another method to improve the speed of learning is the use of human demonstrations to explore the state space of a system. Several attempts have been made to do so, 7,8 however, these attempts did use a model of the dynamics of the system at hand. A method that accelerates the reinforcement learning in a model-free and online fashion through the use of human demonstrations, has not yet been found.…”
Section: Introductionmentioning
confidence: 99%
“…Successful attempts have been made to include knowledge on the dynamics of the system to speed up the learning. 6,7 These attempts used different kinds of knowledge on the system dynamics to do so. Another method to improve the speed of learning is the use of human demonstrations to explore the state space of a system.…”
Section: Introductionmentioning
confidence: 99%