2016
DOI: 10.1007/978-3-662-49831-6_100
|View full text |Cite
|
Sign up to set email alerts
|

A Parameter-Free Gradient Bayesian Two-Action Learning Automaton Scheme

Abstract: Reinforcement learning is one of the subjects of Artificial Intelligence and learning automata have been considered as one of the most powerful tools in this research area. A learning automaton (LA) is a learning machine that can learn an optimal action through interacting with stochastic environments. However, its performance highly depends on the selection of a configurable learning speed. Granmo proposed a Bayesian Learning Automaton (BLA), which is reported to not rely on such external parameters, to solve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 16 publications
0
11
0
Order By: Relevance
“…An important feature of learning systems is the ability to improve their efficiency over time. In mathematical terms, it can be stated that the purpose of a learning system is to optimize a task that is not well known [12,13]. Therefore, an approach to this problem is to reduce the goals of the learning system to an optimization problem, which is defined on a set of parameters and aims to find a set of optimal parameters.…”
Section: Definitionmentioning
confidence: 99%
“…An important feature of learning systems is the ability to improve their efficiency over time. In mathematical terms, it can be stated that the purpose of a learning system is to optimize a task that is not well known [12,13]. Therefore, an approach to this problem is to reduce the goals of the learning system to an optimization problem, which is defined on a set of parameters and aims to find a set of optimal parameters.…”
Section: Definitionmentioning
confidence: 99%
“…So extra efforts are necessary to realize the trade-off between the accuracy and the convergence rate in a specific environment. Most traditional schemes are parameter-sensitive, and the cost of parameter tuning can be extremely expensive [28]. In practical applications, especially where interacting with the environment could be expensive, the enormous cost for parameter tuning is intimidating.…”
Section: Introductionmentioning
confidence: 99%
“…Several parameter-free schemes have been proposed in recent years to address the problem of parameter tuning. The parameter-free concept, which is first presented in [29], indicates that a set of parameters can be universally applied to all environments without further tuning. The most representative parameter-free schemes are the parameter-free LA (PFLA) [28] and loss function-based LA (LFPLA) [30].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The most prominent features of learning-based systems are that they improve themselves over time. In mathematical terms, it can be stated that the purpose of a learning system is to optimize a task that is not well known [3,4]. Therefore, an approach to this problem is to reduce the goals of the learning system to an optimization problem, which is defined on a set of parameters and aims to find a set of optimal (appropriate) parameters.…”
Section: Introductionmentioning
confidence: 99%