2011
DOI: 10.1109/tsmcb.2011.2118749
|View full text |Cite
|
Sign up to set email alerts
|

Decentralized Indirect Methods for Learning Automata Games

Abstract: We discuss the application of indirect learning methods in zero-sum and identical payoff learning automata games. We propose a novel decentralized version of the well-known pursuit learning algorithm. Such a decentralized algorithm has significant computational advantages over its centralized counterpart. The theoretical study of such a decentralized algorithm requires the analysis to be carried out in a nonstationary environment. We use a novel bootstrapping argument to prove the convergence of the algorithm.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
19
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(19 citation statements)
references
References 35 publications
0
19
0
Order By: Relevance
“…LA selects the optimal one through interacting with an environment that provides an appropriate response (reward or penalty), which is used to update the selection probability vector of actions, as described in Algorithm 1. In recent years, LAs have been successfully applied to systems that possess incomplete knowledge about the environment, such as game playing [7], decision analysis [13], multiconstraint assignment [6], complete L-fuzzy matrix [14], and object partitioning [15]. Furthermore, LAs have been used to solve behavior recognition problems [16], stabilize distributed queuing systems [17] and continuous function optimization [18].…”
Section: A La and Continuous Pursuit Reward-inaction (Cp Ri ) Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…LA selects the optimal one through interacting with an environment that provides an appropriate response (reward or penalty), which is used to update the selection probability vector of actions, as described in Algorithm 1. In recent years, LAs have been successfully applied to systems that possess incomplete knowledge about the environment, such as game playing [7], decision analysis [13], multiconstraint assignment [6], complete L-fuzzy matrix [14], and object partitioning [15]. Furthermore, LAs have been used to solve behavior recognition problems [16], stabilize distributed queuing systems [17] and continuous function optimization [18].…”
Section: A La and Continuous Pursuit Reward-inaction (Cp Ri ) Algorithmmentioning
confidence: 99%
“…Each LA is responsible for learning a variable, such as a high dimensional parameter or a task in scheduling problem, etc. In order to supply a learning model that consists of many LAs that learn at the same time, decentralized learning automata (DLA) [6] [7] are presented. In general, those LAs in DLA are VSSA.…”
Section: Introductionmentioning
confidence: 99%
“…We use the Decentralized Pursuit Learning game Algorithm (DPLA) [13]. In the DPLA, each participating automata maintains a vectorˆ ( ) where k refers to the current trial and 1 ≤ ≤ is the automaton index.…”
Section: Decentralized Pursuit Learning Game Algorithmmentioning
confidence: 99%
“…the adaption to the environment. LAs have found a broad range of applications reported in the literature, such as game playing [2]- [4], pattern recognition [5], classification [6], knapsack problem [7], tutorial-like system [8], [9], object partitioning [10]- [13], cellular automata [14], telephony routing [15], [16], scheduling [17], minimum-spanning circle problem [18], congestion avoidance [19], function optimization [20], [21], resource allocation and assignment problems [22]- [24], automaton controller [25], control absorption columns, flexible manufacturing plants, and other applications such as dryers, vehicles, irrigation canals, multimedia network, robots, liquid-liquid extraction columns, bioreactors, distributed fuzzy logic processors, image processing, and data compression. Various LAs, their properties and applications have been reviewed in survey papers [26], [27] and books [28]- [32].…”
Section: Introductionmentioning
confidence: 99%