2011
DOI: 10.1177/0278364911428653
|View full text |Cite
|
Sign up to set email alerts
|

Robot learning from demonstration by constructing skill trees

Abstract: We describe CST, an online algorithm for constructing skill trees from demonstration trajectories. CST segments a demonstration trajectory into a chain of component skills, where each skill has a goal and is assigned a suitable abstraction from an abstraction library. These properties permit skills to be improved efficiently using a policy learning algorithm. Chains from multiple demonstration trajectories are merged into a skill tree. We show that CST can be used to acquire skills from human demonstration in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
194
0
1

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 262 publications
(198 citation statements)
references
References 30 publications
3
194
0
1
Order By: Relevance
“…There is some recent work in interpreting human actions and interaction with objects [25,1,17] in context of learning to perform actions from demonstrations. Lopes et al [25] use context from objects in terms of possible grasp a↵ordances to focus the attention of their recognition system.…”
Section: Related Workmentioning
confidence: 99%
“…There is some recent work in interpreting human actions and interaction with objects [25,1,17] in context of learning to perform actions from demonstrations. Lopes et al [25] use context from objects in terms of possible grasp a↵ordances to focus the attention of their recognition system.…”
Section: Related Workmentioning
confidence: 99%
“…Aldoma et al [2] proposed a method to find affordances which depends solely on the objects of interest and their position and orientation in the scene. There is some recent work in interpreting human actions and interaction with objects [26,1,20] in context of learning to perform actions from demonstrations. Lopes et al [26] use context from objects in terms of possible grasp affordances to focus the attention of their recognition system.…”
Section: Related Workmentioning
confidence: 99%
“…Clustering was also used to cluster subgoals to prevent the creation of multiple options that all correspond to the same underlying skill (Niekum and Barto 2011). Konidaris and colleagues (Konidaris and Barto 2009b, Konidaris et al 2011a, 2012b illustrated the utility of setting the goal of an option to be reaching the initiation set of an already-formed option in a process called "skill chaining." This method is used in the example described in Section 5.1 below.…”
Section: Hierarchical Reinforcement Learningmentioning
confidence: 99%