2021
DOI: 10.1177/01423312211052742
|View full text |Cite
|
Sign up to set email alerts
|

Guidance law of interceptors against a high-speed maneuvering target based on deep Q-Network

Abstract: This paper proposes a novel guidance law for intercepting a high-speed maneuvering target based on deep reinforcement learning, which mainly includes the interceptor–target relative motion model and value function approximation model based on deep Q-Network (DQN) with prioritized experience replay. First, a method called prioritized experience replay is applied to extract more efficient samples and reduce the training time. Second, to cope with the discrete action space of DQN, a normal acceleration is introdu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…Near space high-speed vehicle technology is an important milestone in the history of modern weapons and equipment. As the commanding height of science and technology, it greatly enriches the content of attack-defense confrontation in near space [1][2][3][4]. Aiming at the optimal penetration guidance problem of high-speed vehicles against a modified proportional guidance interceptor, this paper obtains the analytical solution of an optimal penetration strategy by introducing Hamilton's principle [5][6][7].…”
Section: Introductionmentioning
confidence: 99%
“…Near space high-speed vehicle technology is an important milestone in the history of modern weapons and equipment. As the commanding height of science and technology, it greatly enriches the content of attack-defense confrontation in near space [1][2][3][4]. Aiming at the optimal penetration guidance problem of high-speed vehicles against a modified proportional guidance interceptor, this paper obtains the analytical solution of an optimal penetration strategy by introducing Hamilton's principle [5][6][7].…”
Section: Introductionmentioning
confidence: 99%
“…However, the orientation of the landing platform is not considered, and vertical velocity control is not included in the action set. A DRL-based guidance law is proposed in [15] to deal with maneuvering of high-speed targets. Based on the DQN algorithm, a relative-motion model is established and reward function is designed that can obtain continuous acceleration commands, make the LOS rate converge to zero rapidly, and hit the maneuvering target using only the LOS rate.…”
Section: Introductionmentioning
confidence: 99%