2008
DOI: 10.1007/s10514-008-9095-6
|View full text |Cite
|
Sign up to set email alerts
|

Geodesic Gaussian kernels for value function approximation

Abstract: The least-squares policy iteration approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular and useful choice as a basis function. However, it does not allow for discontinuity which typically arises in real-world reinforcement learning tasks. In this paper, we propose a new basis function based on geodesic Gaussian kernels, which exploits the non-linear manifold structure induced by the Markov decision processes. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
5
2
2

Relationship

3
6

Authors

Journals

citations
Cited by 27 publications
(18 citation statements)
references
References 17 publications
0
17
0
Order By: Relevance
“…Thus, accurately approximating the value function is a challenge in the value function based approach. So far, various machine learning techniques have been employed for better value function approximation, such as least-squares approximation [12], manifold learning [17], efficient sample reuse [6], active learning [2], and robust learning [16].…”
Section: Policy Iteration Vs Policy Searchmentioning
confidence: 99%
“…Thus, accurately approximating the value function is a challenge in the value function based approach. So far, various machine learning techniques have been employed for better value function approximation, such as least-squares approximation [12], manifold learning [17], efficient sample reuse [6], active learning [2], and robust learning [16].…”
Section: Policy Iteration Vs Policy Searchmentioning
confidence: 99%
“…Tobias and Daniel proposed a LSTD approach based on SVMs [96]. Several researchers have investigated designing specialized kernels that exploit manifold structure in the state space [90,91,59,60,10,9,87]. This work represents exciting progress; however, the field of kernel-based ADP has developed only recently, and there remain numerous possibilities that are yet unexplored.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Investigation of manifold-based kernels, and their relationship to n-stage BRE A number of researchers have proposed using kernels that exploit manifold structure on the state space as a means of devising cost approximation algorithms [90,91,59,60,10,9,87]. We believe that these kernels are particularly appropriate for use in our BRE algorithms, and propose to test these manifold-based kernels in several BRE test problems.…”
Section: Further Bre Algorithm Development/extensionmentioning
confidence: 99%
“…Recently, more sophisticated methods of constructing suitable basis functions have been proposed, which effectively make use of the graph structure induced by MDPs [5]. In this section, we introduce a novel way of constructing basis functions by incorporating the graph structure; while relation to the existing graph-based methods is discussed in the separate report [14].…”
Section: Gaussian Kernels On Graphsmentioning
confidence: 99%