A kinodynamic planning-learning algorithm for complex robot motor control

Gonzalez-Quijano, Javier; Abderrahim, Mohamed; Fernández, Fernando; Bensalah, Choukri

doi:10.1109/eais.2012.6232809

Cited by 2 publications

(1 citation statement)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Nonetheless, the scalability of this class of algorithms to high dimensional state and action spaces is still a matter of study, more in the case of continues state-action spaces. To overcome this problem, the authors of this publication proposed in previous work a new model-based learning algorithm, called KiPLA [8], which handles much more efficiently the curse of dimensionality problem. This algorithm mixes kinodynamic planning and model learning for the purpose of finding suboptimal open-loop policies for achieving a certain task.…”

Section: Related Workmentioning

confidence: 97%

RoMPLA: An efficient robot motion and planning learning architecture

Gonzalez-Quijano

Abderrahim

Bensalah

et al. 2013

2013 IEEE/RSJ International Conference on Intelligent Robots and Systems

Self Cite

View full text Add to dashboard Cite

Robot motor skill learning is currently one of the most active research areas in robotics. Many learning techniques have been developed for relatively simple problems. However, very few of them have direct applicability in complex robotics systems without assuming prior knowledge about the task due to two facts. On one hand, they scale badly to continues and high dimensional problems. On the other hand, they require too many real learning episodes. In this sense, this paper provides a detailed description of an original approach capable of learning from scratch suboptimal solutions and of providing closed-loop motor control policies in the proximity of such solutions. The developed architecture manages the solution in two consecutive phases. The first phase provides an initial openloop solution state-action trajectory by mixing kinodynamic planning with model learning. In the second phase, the initial state trajectory solution is first smoothed and then, a closedloop controller with active learning capabilities is learned in its proximity. We will demonstrate the efficiency of this two phases approach in the Cart-Pole Swing-Up Task problem.

show abstract

Section: Related Workmentioning

confidence: 97%