2022
DOI: 10.3390/electronics11213583
|View full text |Cite
|
Sign up to set email alerts
|

Online Adaptive Dynamic Programming-Based Solution of Networked Multiple-Pursuer and Single-Evader Game

Abstract: This paper presents a new scheme for the online solution of a networked multi-agent pursuit–evasion game based on an online adaptive dynamic programming method. As a multi-agent in the game can form an Internet of Things (IoT) system, by incorporating the relative distance and the control energy as the performance index, the expression of the policies when the agents reach the Nash equilibrium is obtained and proved by the minmax principle. By constructing a Lyapunov function, the capture conditions of the gam… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…On the optimization of database use, Crispim et al [19] proposed using the TOPSIS method to sort and select candidate partners; Wang Daoqu [20] combined AHP and TOPSIS to achieve partner selection; Gong et al [21] used the adaptive dynamic programming method to research and solve the online solution of the network multiagent pursuit and evasion game so that each agent can obtain the strategy to achieve Nash equilibrium in real time; Liang and Xu [22] established a fnite-time domain Markov decision process model with the goal of maximizing the benefts of the hospital in terms of inspection equipment, and combined it with the dynamic programming theory to obtain the optimal reservation scheduling strategy of the system. In order to adapt the warehouse to the increasing variety and quantity of storage products, Djurdjević et al [23] used the dynamic programming method to obtain the optimal allocation of products in diferent order-picking areas.…”
Section: Literature Reviewmentioning
confidence: 99%
“…On the optimization of database use, Crispim et al [19] proposed using the TOPSIS method to sort and select candidate partners; Wang Daoqu [20] combined AHP and TOPSIS to achieve partner selection; Gong et al [21] used the adaptive dynamic programming method to research and solve the online solution of the network multiagent pursuit and evasion game so that each agent can obtain the strategy to achieve Nash equilibrium in real time; Liang and Xu [22] established a fnite-time domain Markov decision process model with the goal of maximizing the benefts of the hospital in terms of inspection equipment, and combined it with the dynamic programming theory to obtain the optimal reservation scheduling strategy of the system. In order to adapt the warehouse to the increasing variety and quantity of storage products, Djurdjević et al [23] used the dynamic programming method to obtain the optimal allocation of products in diferent order-picking areas.…”
Section: Literature Reviewmentioning
confidence: 99%
“…A multiagent pursuit-evasion game requires that a set of agents persuade a different set of agents (evaders) to cooperate in order to be apprehended [16,17]. The playing eld, the players' game information and the players' ability to manage their movements Other crucial factors to take into account include the exibility of the evaders' manoeuvrability and the concept of a capture [18,19]. Another type of chase-evasion game involves the pursuit and evasion of players in a de ned setting, such as a grid map.…”
Section: Introductionmentioning
confidence: 99%