2021
DOI: 10.23919/jsee.2021.000079
|View full text |Cite
|
Sign up to set email alerts
|

A guidance method for coplanar orbital interception based on reinforcement learning

Abstract: This paper investigates the guidance method based on reinforcement learning (RL) for the coplanar orbital interception in a continuous low-thrust scenario. The problem is formulated into a Markov decision process (MDP) model, then a welldesigned RL algorithm, experience based deep deterministic policy gradient (EBDDPG), is proposed to solve it. By taking the advantage of prior information generated through the optimal control model, the proposed algorithm not only resolves the convergence problem of the common… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 9 publications
0
1
0
Order By: Relevance
“…, and , the final state and costate can be obtained through integrating (1) and (8). The final position vectors should satisfy (3), final position costate should satisfy (12), final velocity costate should satisfy (10), and final Hamiltonian should satisfy (11).…”
Section: Problem Formulationmentioning
confidence: 99%
See 2 more Smart Citations
“…, and , the final state and costate can be obtained through integrating (1) and (8). The final position vectors should satisfy (3), final position costate should satisfy (12), final velocity costate should satisfy (10), and final Hamiltonian should satisfy (11).…”
Section: Problem Formulationmentioning
confidence: 99%
“…Especially, Wu et al [7] proposed a method directly using DNNs to fast derive the optimal interception trajectories, learning optimal control law from datasets generated by traditional methods. George [8] presented a method in reinforcement learning manner, combining networks and evolutionary algorithm to solve optimal trajectories for interception and rendezvous scenarios with three different thrust configurations. Zeng et al [9] proposed a reinforcement learning method for coplanar orbital interception in long-distance scenarios, where an efficient DNN controller was trained to generate the optimal control sequence for spacecraft.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In recent years, deep learning (DL) and reinforcement learning (RL) have become hot topics in artificial intelligence technology, providing new design options for aircraft guidance and control systems [13][14][15]. The principle of RL originates from the process of intelligent species learning new things.…”
Section: Introductionmentioning
confidence: 99%