2012
DOI: 10.1007/978-3-642-35506-6_29
|View full text |Cite
|
Sign up to set email alerts
|

Binary Space Partitioning as Intrinsic Reward

Abstract: An autonomous agent embodied in a humanoid robot, in order to learn from the overwhelming flow of raw and noisy sensory, has to effectively reduce the high spatial-temporal data dimensionality. In this paper we propose a novel method of unsupervised feature extraction and selection with binary space partitioning, followed by a computation of information gain that is interpreted as intrinsic reward, then applied as immediate-reward signal for the reinforcement-learning. The space partitioning is executed by tin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
2
1
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 5 publications
0
6
0
Order By: Relevance
“…The following discussion of reinforcement-learning and binary space partitioning is an abbreviated version of a more detailed presentation by Skaba (2012).…”
Section: Evaluating the Conceptsmentioning
confidence: 99%
“…The following discussion of reinforcement-learning and binary space partitioning is an abbreviated version of a more detailed presentation by Skaba (2012).…”
Section: Evaluating the Conceptsmentioning
confidence: 99%
“…The application of binary space-partitioning for the estimation of the immediate-reward was first introduced in [6]. Fig.…”
Section: B Binary Space-partitioning and Computation Of The Rewardmentioning
confidence: 99%
“…The mean reward, rather than the reward provided by a positive example, is interpreted as immediate-reward for TDlearning, for we want to estimate the value of a concept. Self-information is also a special case of Kullback-Leibler distance from a Kronecker delta representing the matching pattern to the probability distribution [6], [8].…”
Section: B Binary Space-partitioning and Computation Of The Rewardmentioning
confidence: 99%
See 2 more Smart Citations