2019
DOI: 10.1109/access.2019.2933002
|View full text |Cite
|
Sign up to set email alerts
|

UAV Autonomous Target Search Based on Deep Reinforcement Learning in Complex Disaster Scene

Abstract: In recent years, artificial intelligence has played an increasingly important role in the field of automated control of drones. After AlphaGo used Intensive Learning to defeat the World Go Championship, intensive learning gained widespread attention. However, most of the existing reinforcement learning is applied in games with only two or three moving directions. This paper proves that deep reinforcement learning can be successfully applied to an ancient puzzle game Nokia Snake after further processing. A game… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
59
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 108 publications
(59 citation statements)
references
References 23 publications
0
59
0
Order By: Relevance
“…P π(s) s,st+1 denotes the transition probability unknown in reality, and π(s) is the action generated under a specific policy. Based on (17) and (18), the cost functions (11), (12) can be rewritten as:…”
Section: A Bellman Equationmentioning
confidence: 99%
See 3 more Smart Citations
“…P π(s) s,st+1 denotes the transition probability unknown in reality, and π(s) is the action generated under a specific policy. Based on (17) and (18), the cost functions (11), (12) can be rewritten as:…”
Section: A Bellman Equationmentioning
confidence: 99%
“…In order to achieve the ability to learn automatically, we design the updating steps as following [35]: 1) evaluating result: obtain V i,π (s) and V total,π (q) according to (17) and (18) based on the policy π for all status.…”
Section: A Bellman Equationmentioning
confidence: 99%
See 2 more Smart Citations
“…The ant colony algorithm [38][39][40][41][42][43] is used to solve the order of shipments in each cluster and to dispatch the shortest path.…”
Section: Ant Colony Algorithm For Tsp Problemmentioning
confidence: 99%