2018
DOI: 10.3390/electronics7110279
|View full text |Cite
|
Sign up to set email alerts
|

Research on Air Confrontation Maneuver Decision-Making Method Based on Reinforcement Learning

Abstract: With the development of information technology, the degree of intelligence in air combat is increasing, and the demand for automated intelligent decision-making systems is becoming more intense. Based on the characteristics of over-the-horizon air combat, this paper constructs a super-horizon air combat training environment, which includes aircraft model modeling, air combat scene design, enemy aircraft strategy design, and reward and punishment signal design. In order to improve the efficiency of the reinforc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
33
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 59 publications
(33 citation statements)
references
References 11 publications
0
33
0
Order By: Relevance
“…The third scenario is red agent fleeing. We compare our method with AC [28] and DDPG [29] in the three scenarios. During training, three methods use same environment models and reward models proposed in the paper.…”
Section: A Effectiveness Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…The third scenario is red agent fleeing. We compare our method with AC [28] and DDPG [29] in the three scenarios. During training, three methods use same environment models and reward models proposed in the paper.…”
Section: A Effectiveness Comparisonmentioning
confidence: 99%
“…The method of offline learning and online decisionmaking can satisfy both the time limit and the optimal solution requirement. The application of reinforcement learning in air combat is mainly based on the value function search [25][26][27] and policy search [28,29]. Literature [25] applies the Q-learning method to air combat, but does not perform separate learning for the red and blue agents.…”
Section: Introductionmentioning
confidence: 99%
“…The demand for autonomous decision-making algorithms to support automated air confrontation systems is growing. The work by Zhang et al [12] addresses such demand by presenting the development of a super-horizon air confrontation training environment. The authors employ computational intelligence approaches, including reinforcement learning and neural networks, to create a self-learning air confrontation maneouver decision making system, which is tested by means of complex simulations of different air confrontation situations.…”
Section: The Present Special Issuementioning
confidence: 99%
“…References [15,16] use the Bayesian theory to select the optimal discrete maneuver for maneuvering decision-making; it requires high data accuracy. References [17,18] use a deep reinforcement learning technique to deal with maneuvering decision-making problems, with low realtime performance.…”
Section: Introductionmentioning
confidence: 99%