2002
DOI: 10.1541/ieejeiss1987.122.3_374
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning Using Adaptive Search Method

Abstract: We propose an adaptive probability density function (PDF) to select an effective action on reinforcement learning (RL). The uniform distribution function and the normal distribution function of an action are often used to select an action. When these functions are used, however, the information of search direction is not considered. The proposed method utilizing the information of it enables RL to reduce the number of trials, which is needed to real environment learning. Furthermore, the proposed method can be… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2004
2004
2006
2006

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 4 publications
0
1
0
Order By: Relevance
“…When using such a selection algorithm, the actions are discrete values. However, continuous-valued actions can be treated easily by using adaptive PDF [11] or other techniques.…”
Section: Action Selectionmentioning
confidence: 99%
“…When using such a selection algorithm, the actions are discrete values. However, continuous-valued actions can be treated easily by using adaptive PDF [11] or other techniques.…”
Section: Action Selectionmentioning
confidence: 99%