Reinforcement Learning for Connected Autonomous Vehicle Localization via UAVs

Testi, Enrico; Favarelli, Elia; Giorgetti, Andrea

doi:10.1109/metroagrifor50201.2020.9277630

Cited by 18 publications

(17 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We modify the environment in [50] to a decentralized RL setting where 𝑁 agents (UAVs) aim to work together to reach a specific target. Each agent can choose a set of four actions {north, south, west, east} as shown in Fig.…”

Section: B Decentralized Rlmentioning

confidence: 99%

See 1 more Smart Citation

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT

Lei,

Ye,

Xiao

et al. 2021

Preprint

View full text Add to dashboard Cite

Section: B Decentralized Rlmentioning

confidence: 99%

“…We assume the scenario contains only light-of-sight components. The estimated position of agents can be obtained as in [50]. The reward function is defined as:…”

Section: B Decentralized Rlmentioning

confidence: 99%

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT

Lei,

Ye,

Xiao

et al. 2021

Preprint

View full text Add to dashboard Cite

“…Indeed, time is a key aspect for UAV networks because of their limited energy autonomy [5][6][7] and, thus, it should be properly accounted for when designing the UAV control for time-critical applications (e.g., search-and-rescue). In [5], an information-seeking algorithm is developed for extraterrestrial exploration and return-to-base application, whereas in [8,9] a similar problem is solved using RL for source localization. Algorithms for UAVs formation, navigation and self-localization have been proposed in [10][11][12][13][14], and RL for enhancing communications has been studied in [15][16][17][18].…”

Section: Introductionmentioning

confidence: 99%

Real-Time Learning for THZ Radar Mapping and UAV Control

Guerra

Guidi

Dardari

et al. 2021

2021 IEEE International Conference on Autonomous Systems (ICAS)

View full text Add to dashboard Cite

In this paper we consider a joint detection, mapping and navigation problem by an unmanned aerial vehicle (UAV) with real-time learning capabilities. We formulate this problem as a Markov decision process (MDP), where the UAV is equipped with a THz radar capable to electronically scan the environment with high accuracy and to infer its probabilistic occupancy map. The navigation task amounts to maximizing the desired mapping accuracy and coverage and to decide whether targets (e.g., people carrying radio devices) are present or not. With the numerical results, we analyze the robustness of the considered Q-learning algorithm, and we discuss practical applications.

show abstract

“…For example, UAVs have played a central role in emergency situations in hazardous environments, for post natural disasters, or for search-and-rescue operations. In such events, UAVs have been used as a temporary network infrastructure for localization, communications, and for delivering items [1]- [3].…”

Section: Introductionmentioning

confidence: 99%

“…In this sense, machine learning (ML) can help in acquiring a knowledge of the model through experience. To that end, we adopt reinforcement learning (RL), which is based on the "trial-and-error" philosophy that allows to choose actions in order to maximize the sum of the discounted rewards over the future [3], [6]- [8]. In such settings, UAV navigation is driven by the balance between "exploration" and "exploitation".…”

Section: Introductionmentioning

confidence: 99%

Multi-Agent Q-Learning in UAV Networks for Target Detection and Indoor Mapping

Guerra

Guidi

Dardari

et al. 2021

2021 International Balkan Conference on Communications and Networking (BalkanCom)

View full text Add to dashboard Cite

We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multiagent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.

show abstract

Reinforcement Learning for Connected Autonomous Vehicle Localization via UAVs

Cited by 18 publications

References 14 publications

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT

Real-Time Learning for THZ Radar Mapping and UAV Control

Multi-Agent Q-Learning in UAV Networks for Target Detection and Indoor Mapping

Contact Info

Product

Resources

About