2019 IEEE Global Communications Conference (GLOBECOM) 2019
DOI: 10.1109/globecom38437.2019.9014151
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Deep Reinforcement Learning for Cooperative Connected Vehicles

Abstract: Millimeter-wave (mmWave) base station can offer abundant high capacity channel resources toward connected vehicles so that quality-of-service (QoS) of them in terms of downlink throughput can be highly improved. The mmWave base station can operate among existing base stations (e.g., macro-cell base station) on non-overlapped channels among them and the vehicles can make decision what base station to associate, and what channel to utilize on heterogeneous networks. Furthermore, because of the non-omni property … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
6
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 25 publications
0
6
0
Order By: Relevance
“…In addition, we set the ratio of results by local computing and the OCD algorithm as an indicator of the reward. Furthermore, we set R capacity as a negative reward in order to have 20% of R combi in (6). According to this setting, the maximum value of R combi is 0.8.…”
Section: Simulation Results and Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition, we set the ratio of results by local computing and the OCD algorithm as an indicator of the reward. Furthermore, we set R capacity as a negative reward in order to have 20% of R combi in (6). According to this setting, the maximum value of R combi is 0.8.…”
Section: Simulation Results and Discussionmentioning
confidence: 99%
“…Among them, this paper considers reinforcement learning based approaches because this given problem is for stochastic sequential offloading decision-making. Among a lot of deep reinforcement learning (DRL) methodologies such as Q-learning, Markov decision process (MDP) [5], deep Q-network (DQN), and deep deterministic policy gradient (DDPG) [6,7], this paper designs a sequential offloading decision-making algorithm based on DQN. The reason why this paper considers DQN is that it is the function approximation of Q-learning using deep neural network (DNN) in order to take care of large-scale problem setting.…”
mentioning
confidence: 99%
“…Authors in [26] mitigate the inter-beam inter-cell interference through joint usercell association and selection of number of beams. Finally, MARL emerges as a promising approach to cope with traffic signal control [27] and resource allocation [28] in vehicular networks, user association [29] and handover management [30] in mmWave networks.…”
Section: Related Workmentioning
confidence: 99%
“…Among various DRL algorithms, deep Q-network (DQN) is one of the most successful early-stage initial frameworks [31][32][33]. The DRL algorithms are extended from single-agent to multi-agent for cooperative and coordinated computation, and this is called MADRL [34,35]. In MADRL, CommNet [13,26] and the abstraction mechanism based on two-stage attention network (G2ANet) [36] are famous.…”
Section: Related Workmentioning
confidence: 99%