2004
DOI: 10.1016/j.neunet.2004.05.004
|View full text |Cite
|
Sign up to set email alerts
|

Reliability of internal prediction/estimation and its application. I. Adaptive action selection reflecting reliability of value function

Abstract: This article proposes an adaptive action-selection method for a model-free reinforcement learning system, based on the concept of the 'reliability of internal prediction/estimation'. This concept is realized using an internal variable, called the Reliability Index (RI), which estimates the accuracy of the internal estimator. We define this index for a value function of a temporal difference learning system and substitute it for the temperature parameter of the Boltzmann action-selection rule. Accordingly, the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2006
2006
2015
2015

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 26 publications
0
7
0
Order By: Relevance
“…A literature review shows that this study is the first to use a nationwide population-based database for training and testing a neural network to predict HCC surgery outcomes. Unlike other standard statistical models, ANNs can also manage complexity even with small samples and with an unbalanced ratio between variables and records [8][9][10]. That is, ANNs overcome the problem of dimensionality.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…A literature review shows that this study is the first to use a nationwide population-based database for training and testing a neural network to predict HCC surgery outcomes. Unlike other standard statistical models, ANNs can also manage complexity even with small samples and with an unbalanced ratio between variables and records [8][9][10]. That is, ANNs overcome the problem of dimensionality.…”
Section: Discussionmentioning
confidence: 99%
“…Although many different ANNs have been developed, one of the most common structures consists of an interconnected group of nodes in multiple layers, in which input nodes and output nodes have clinical correlates [9,10]. The nodes are connected by links, each of which has an associated weight.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…In [38], the authors employed an interesting strategy of internal prediction/estimation to control the balance between exploration and exploitation for efficient adaptability of an agent in a new environment. Here, a reliability parameter RI has been introduced as an internal variable to estimate the 'expected prediction error'.…”
Section: Introductionmentioning
confidence: 99%