2013
DOI: 10.1007/s10489-013-0424-x
|View full text |Cite
|
Sign up to set email alerts
|

On incorporating the paradigms of discretization and Bayesian estimation to create a new family of pursuit learning automata

Abstract: There are currently two fundamental paradigms that have been used to enhance the convergence speed of Learning Automata (LA). The first involves the concept of utilizing the estimates of the reward probabilities, while the second involves discretizing the probability space in which the LA operates. This paper demonstrates how both of these can be simultaneously utilized, and in particular, by using the family of Bayesian estimates that have been proven to have distinct advantages over their maximum likelihood … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 34 publications
(24 citation statements)
references
References 31 publications
0
24
0
Order By: Relevance
“…During the last decade, SE ri has been considered as the state-of-art algorithm for a long time, however, some recently proposed algorithms [8], [9] claim a faster convergence than SE ri . To make a comprehensive comparison among currently available techniques, as well as to verify the effectiveness of the proposed parameter-free scheme, in this section, PFLA is compared with several classic parameter-based learning automata schemes, including DP ri [20], DGPA [4], DBPA [6], DGCPA * [8], SE ri [5], GBSE [9] and LELA R [7].…”
Section: Simulation Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…During the last decade, SE ri has been considered as the state-of-art algorithm for a long time, however, some recently proposed algorithms [8], [9] claim a faster convergence than SE ri . To make a comprehensive comparison among currently available techniques, as well as to verify the effectiveness of the proposed parameter-free scheme, in this section, PFLA is compared with several classic parameter-based learning automata schemes, including DP ri [20], DGPA [4], DBPA [6], DGCPA * [8], SE ri [5], GBSE [9] and LELA R [7].…”
Section: Simulation Resultsmentioning
confidence: 99%
“…In [6], DBPA was proposed where the posterior distribution of estimatedĉ i is represented by a beta distribution Beta(α, β), the parameter α and β record the number of times that a specific action has been rewarded and penalized respectively. Then the 95 th percentile of the cumulative posterior distribution is utilized as estimation of c i .…”
Section: ) a Comprehensive Comparison Among Recently Proposedmentioning
confidence: 99%
See 1 more Smart Citation
“…Table 3 shows the comparison between the t 0 calculated from (18) and the average number of iterations needed for the LA to converge, in practice. As the CPA and ACPA are well-established algorithms, we know that numerous experiments have been conducted to confirm their validity, and so we merely use the figures from [30] to record the practical results.…”
Section: Remarkmentioning
confidence: 99%
“…In contrast to any LA scheme presented in the literature, our solution is especially designed for deterministic environments. & Our current solution extends the family of pursuit LA algorithms [3,38,39] to solve deterministic optimization problems. A common feature for all legacy pursuit algorithms is to estimate the reward probability of each action and pursue the action with the highest reward.…”
Section: Introductionmentioning
confidence: 99%