A Q-Learning-Based Approach for Simple and Multi-Agent Systems

Ulusoy, Ümit; Güzel, Mehmet Serdar; Bostancı, Erkan

doi:10.5772/intechopen.88484

Cited by 2 publications

(5 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Strategies are the most critical guidelines for how to act in a particular situation. Effective strategies to be implemented in the arena are of great importance for international tournaments that are organized annually for the RoboCode war simulator [13]. It should be noted that, while a robot can perform well in a one-on-one battle, other adaptive strategies may be required for close combat.…”

Section: Battlingmentioning

confidence: 99%

“…Q-learning is one of the leading off-policy RL algorithms, preferred in another recent study due to its efficiency and popularity. However, in this study, an artificial neural network is designed to approximate Q values instead of trying to keep them in a "Qtable," which is essentially not possible for such a continuous space problem [8]. e neural network has a very modest structure, involving only two layers.…”

Section: Battlingmentioning

confidence: 99%

“…Along with the first award estimated, the agent starts inserting the "q value" regarding the relation between the state and action into the Q-table. Subsequently, the agent starts to predict and reach the prize by maximizing the values of forward-looking moves at each iteration [8].…”

Section: Introductionmentioning

confidence: 99%

“…RoboCode is an open-source war simulator program developed by IBM using Java programming language in 2001 [12,13]. e purpose of this simulator is to program a war robot using the classes offered by the platform to the users in a two-dimensional environment and to measure the performance of the programmed agent by fighting these robots in the environment provided by this simulator which is called the arena.…”

Section: Introductionmentioning

confidence: 99%

“…In order for a robot to be a winner, it is not sufficient only to survive on all rounds because robots can earn more points than the winner due to some offensive actions. A list of the rules for the RoboCode platform can be seen in [8,9]. e flow diagram of the process cycle used by the RoboCode engine can also be seen in Figure 2.…”

mentioning

confidence: 99%

See 4 more Smart Citations

A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q‐Learning

et al. 2021

Self Cite

View full text Add to dashboard Cite

This paper addresses a new machine learning-based behavioral strategy using the deep Q-learning algorithm for the RoboCode simulation platform. According to this strategy, a new model is proposed for the RoboCode platform, providing an environment for simulated robots that can be programmed to battle against other robots. Compared to Atari Games, RoboCode has a fairly wide set of actions and situations. Due to the challenges of training a CNN model for such a continuous action space problem, the inputs obtained from the simulation environment were generated dynamically, and the proposed model was trained by using these inputs. The trained model battled against the predefined rival robots of the environment (standard robots) by cumulatively benefiting from the experience of these robots. The comparison between the proposed model and standard robots of RoboCode Platform was statistically verified. Finally, the performance of the proposed model was compared with machine learning based-customized robots (community robots). Experimental results reveal that the proposed model is mostly superior to community robots. Therefore, the deep Q-learning-based model has proven to be successful in such a complex simulation environment. It should also be noted that this new model facilitates simulation performance in adaptive and partially cluttered environments.

show abstract

Section: Battlingmentioning

confidence: 99%

Section: Battlingmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

See 3 more Smart Citations