Online Adaptive Dynamic Programming-Based Solution of Networked Multiple-Pursuer and Single-Evader Game

Gong, Zifeng; He, Bing; Chen, Hu; Zhang, Xiaobo; Kang, Weijie

doi:10.3390/electronics11213583

Cited by 6 publications

(2 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…On the optimization of database use, Crispim et al [19] proposed using the TOPSIS method to sort and select candidate partners; Wang Daoqu [20] combined AHP and TOPSIS to achieve partner selection; Gong et al [21] used the adaptive dynamic programming method to research and solve the online solution of the network multiagent pursuit and evasion game so that each agent can obtain the strategy to achieve Nash equilibrium in real time; Liang and Xu [22] established a fnite-time domain Markov decision process model with the goal of maximizing the benefts of the hospital in terms of inspection equipment, and combined it with the dynamic programming theory to obtain the optimal reservation scheduling strategy of the system. In order to adapt the warehouse to the increasing variety and quantity of storage products, Djurdjević et al [23] used the dynamic programming method to obtain the optimal allocation of products in diferent order-picking areas.…”

Section: Literature Reviewmentioning

confidence: 99%

Research on Partner Selection of High-Speed Railway Dynamic Logistics Alliance Based on the Dynamic Programming Model

Wang

Zhang

et al. 2023

Journal of Advanced Transportation

View full text Add to dashboard Cite

Traditional railway transportation can no longer meet people’s demand for logistics services. This paper takes advantage of high speed to propose a high-speed railway dynamic logistics alliance based on a cloud platform to make up for the lack of transport capacity at both ends of the high-speed railway logistics trunk line. Selecting partners is crucial to the high-speed rail logistics alliance. This paper uses the methods of multiobjective fuzzy optimization and dynamic programming to conduct multistage optimization of high-speed railway dynamic logistics alliance partners. When the market demand changes, in order to optimize the overall interests of the alliance, this paper uses the efficiency profit field method to achieve the dynamic selection of alliance partners or potential partners. The case study shows that the establishment of the high-speed railway dynamic logistics alliance can optimize the interests of the members of the alliance, verify the effectiveness of the method, and provide a reference for the better development of high-speed railway logistics.

show abstract

Section: Literature Reviewmentioning

confidence: 99%

Research on Partner Selection of High-Speed Railway Dynamic Logistics Alliance Based on the Dynamic Programming Model

Wang

Zhang

et al. 2023

Journal of Advanced Transportation

View full text Add to dashboard Cite

show abstract

“…A multiagent pursuit-evasion game requires that a set of agents persuade a different set of agents (evaders) to cooperate in order to be apprehended [16,17]. The playing eld, the players' game information and the players' ability to manage their movements Other crucial factors to take into account include the exibility of the evaders' manoeuvrability and the concept of a capture [18,19]. Another type of chase-evasion game involves the pursuit and evasion of players in a de ned setting, such as a grid map.…”

Section: Introductionmentioning

confidence: 99%

Cooperative control for multi-player pursuit-evasion games embedded on communication technology with reinforcement learning

kavin,

Kamala

et al. 2023

Preprint

View full text Add to dashboard Cite

Recent advances in research on the Multi-agent System (MAS) optimal control issue will help sectors like robotics, communications, and power systems. This work looks at the intelligent design of a large-scale multi-pursuer and multi-evader pursuit-evasion game. Based on reinforcement learning, a distributed cooperative pursuit method with communication is created. The famed Curse of Dimensionality poses a serious danger to multi-player pursuit-evasion game designs due to the sheer number of agents, especially in hostile areas where there aren't many communication options available to encourage player information exchange. In order to find the best pursuit-evasion strategies using a novel type of probability density function (PDF) rather than exhaustive data from all the remaining teams or agents, the Mean Field Games (MFG) theory has been used. A novel MAS optimum type oversight system with a decentralised and computer-friendly decision method is urgently needed. Mean field game theory is used to create the Actor-critic-mass (ACM), a decentralised optimal control system, to address the aforementioned issues. Additionally, the homogeneous decentralised Actor-critic-mass (HDACM) which improves the ACM method, does away with restrictions like homogeneous agents and cost functions. Finally, two applications make use of the PAS algorithm.

show abstract

Intelligent Escape of Robotic Systems: A Survey of Methodologies, Applications, and Challenges

Li,

Yang

2023

J Intell Robot Syst

View full text Add to dashboard Cite

Online Adaptive Dynamic Programming-Based Solution of Networked Multiple-Pursuer and Single-Evader Game

Cited by 6 publications

References 30 publications

Research on Partner Selection of High-Speed Railway Dynamic Logistics Alliance Based on the Dynamic Programming Model

Research on Partner Selection of High-Speed Railway Dynamic Logistics Alliance Based on the Dynamic Programming Model

Cooperative control for multi-player pursuit-evasion games embedded on communication technology with reinforcement learning

Intelligent Escape of Robotic Systems: A Survey of Methodologies, Applications, and Challenges

Contact Info

Product

Resources

About