2007
DOI: 10.1109/robot.2007.363573
|View full text |Cite
|
Sign up to set email alerts
|

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

Abstract: Abstract-The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular and useful choice as a basis function. However, it does not allow for discontinuity which typically arises in realworld reinforcement learning tasks. In this paper, we propose a new basis function based on geodesic Gaussian kernels, which exploits the non-linear manifold structure induced by the Markov decision processes. The usefu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2008
2008
2021
2021

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 19 publications
0
5
0
Order By: Relevance
“…In this section, we apply our method to three benchmark (simulated) robot motor control problems: (a) pendulum swing-up with limited torque (Doya, 2000), (b) robot arm control with obstacles (Sugiyama et al, 2007), and (c) multi-degrees of freedom (DOF) redundant arm reaching task (Theodorou et al, 2010). Figure 1 illustrates these problems.…”
Section: Simulation Resultsmentioning
confidence: 99%
“…In this section, we apply our method to three benchmark (simulated) robot motor control problems: (a) pendulum swing-up with limited torque (Doya, 2000), (b) robot arm control with obstacles (Sugiyama et al, 2007), and (c) multi-degrees of freedom (DOF) redundant arm reaching task (Theodorou et al, 2010). Figure 1 illustrates these problems.…”
Section: Simulation Resultsmentioning
confidence: 99%
“…Tobias and Daniel proposed a LSTD approach based on SVMs [96]. Several researchers have investigated designing specialized kernels that exploit manifold structure in the state space [90,91,59,60,10,9,87]. This work represents exciting progress; however, the field of kernel-based ADP has developed only recently, and there remain numerous possibilities that are yet unexplored.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Investigation of manifold-based kernels, and their relationship to n-stage BRE A number of researchers have proposed using kernels that exploit manifold structure on the state space as a means of devising cost approximation algorithms [90,91,59,60,10,9,87]. We believe that these kernels are particularly appropriate for use in our BRE algorithms, and propose to test these manifold-based kernels in several BRE test problems.…”
Section: Further Bre Algorithm Development/extensionmentioning
confidence: 99%
“…The current paper is a complete version of our earlier manuscript (Sugiyama et al 2007). The major differences are that we included more technical details of the proposed method in Sect.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we propose a new basis function based on geodesic Gaussian kernels, which exploits the non-linear manifold structure induced by the Markov decision processes. The usefulness of the proposed method is successfully demonstrated in simulated robot arm control and Khepera robot navigation.The current paper is a complete version of our earlier manuscript (Sugiyama et al 2007). The major differences are that we included more technical details of the proposed method in Sect.…”
mentioning
confidence: 99%