Abstract:Abstract. Value function approximation is a critical task in solving Markov decision processes and accurately representing reinforcement learning agents. A significant issue is how to construct efficient feature spaces from agent's samples in order to obtain optimal policy. This study addresses this challenge by proposing an online kernel-based clustering approach for building appropriate basis functions during the learning process. The method uses a kernel function capable of handling pairs of state-action as… Show more
Set email alert for when this publication receives citations?
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.