2002
DOI: 10.1109/tsmcb.2002.1049608
|View full text |Cite
|
Sign up to set email alerts
|

Generalized pursuit learning schemes: new families of continuous and discretized learning automata

Abstract: Abstract-The fastest learning automata (LA) algorithms currently available fall in the family of estimator algorithms introduced by Thathachar and Sastry [24]. The pioneering work of these authors was the pursuit algorithm, which pursues only the current estimated optimal action. If this action is not the one with the minimum penalty probability, this algorithm pursues a wrong action. In this paper, we argue that a pursuit scheme that generalizes the traditional pursuit algorithm by pursuing all the actions wi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
166
0

Year Published

2004
2004
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 152 publications
(166 citation statements)
references
References 44 publications
0
166
0
Order By: Relevance
“…1 This sections presents approaches for to predicting p(t i , n). The first approach is a straightforward solution introduced for comparison purposes.…”
Section: Problem Setting and Modellingmentioning
confidence: 99%
See 3 more Smart Citations
“…1 This sections presents approaches for to predicting p(t i , n). The first approach is a straightforward solution introduced for comparison purposes.…”
Section: Problem Setting and Modellingmentioning
confidence: 99%
“…Learning Automata (LA) have been used in systems that have incomplete knowledge of the environment in which they operate [1,18,25,26,27,32,39]. The learning mechanism attempts to learn from a stochastic Teacher which models the Environment.…”
Section: Meta-learning Algorithm Based On Learning Automatamentioning
confidence: 99%
See 2 more Smart Citations
“…As opposed to this, our scheme resorts to discretizing the probability space [1,5,9,16], and performing a controlled random walk on this discretized space. It is well known in the field of LA that discretized schemes achieve faster convergence speed than continuous schemes [1,8]. By virtue of discretization, our estimator realizes fast adjustments of the running estimates by jumps, and it is thus able to robustly track changes in the parameters of the distribution after a switch has occurred in the environment.…”
Section: State-of-the-artmentioning
confidence: 99%