Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Li, Zheng; Guo, Caili

doi:10.1109/tvt.2019.2961405

Cited by 123 publications

(53 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Methods ranging from heuristic optimization [12] to machine learning [33], have been used under varying sets of users and their requirements. Machine learning, being the current hot topic in this domain; centralized vs. distributed reinforcement learning, multi-agent deep reinforcement learning [34], and other methods that could provide a faster convergence with better performance than traditional methods. Relay-assisted [35], out-of-coverage scenarios [36] and energy efficient resource allocation [37] is currently being researched upon.…”

Section: Discussion and Future Workmentioning

confidence: 99%

Distance-Based Resource Allocation for Vehicle-to-Pedestrian Safety Communication

Khan

Lee

2020

Electronics

View full text Add to dashboard Cite

Cellular Vehicle to Everything (V2X) has redefined the vehicular communication architecture as something that needs an ultra-reliable link, high capacity, and fast message delivery in vehicular networks. The V2X scenarios are broadly categorized as Vehicle to Vehicle (V2V), Vehicle to Infrastructure (V2I), Vehicle to Pedestrians (V2P), and Vehicle to Network (V2N). Vulnerable pedestrians belong to the V2P category and hence require an ultra-reliable link and a fast message delivery in case the moving vehicle is in the close proximity of the pedestrian. However, congestion in the network calls for an optimized resource allocation that would allow a fast and secure connection between a vehicle and the pedestrian. In this paper, we have proposed a distance-based resource allocation that classifies the pedestrians in different categories, performs a one-to-many weighted bipartite matching, and finally a reinforcement learning based power allocation.

show abstract

Section: Discussion and Future Workmentioning

confidence: 99%

Distance-Based Resource Allocation for Vehicle-to-Pedestrian Safety Communication

Khan

Lee

2020

Electronics

View full text Add to dashboard Cite

show abstract

“…A survey of different popular and AI-based interference mitigation and RA approaches developed in D2D communications is provided in [24]. Additionally, the multi-agent actor critic (MAAC) is a newly proposed algorithm in [25] to mitigate interference by efficiently distributing the spectrum allocation. Moreover, the same paper proposes the neighbor-agent actor critic (NAAC) that uses neighbor users' historical information for centralized training leading to outage probability reduction and sum rate improvement for D2D links.…”

Section: For Interference Mitigationmentioning

confidence: 99%

Leveraging Machine-Learning for D2D Communications in 5G/Beyond 5G Networks

Hashima¹,

Elhalwany²,

Hatano³

et al. 2021

Preprint

View full text Add to dashboard Cite

Device-to-device (D2D) communication is a promising paradigm for the fifth generation 2 (5G) and beyond 5G (B5G) networks. Although D2D communication provides several benefits, 3 including limited interference, energy efficiency, reduced delay, and network overhead, it faces a lot 4 of technical challenges such as network architecture, and neighbor discovery, etc. The complexity 5 of configuring D2D links and managing their interference, especially when using millimeter-wave 6 (mmWave), inspire researchers to leverage different machine-learning (ML) techniques to address 7 these problems towards boosting the performance of D2D networks. In this paper, a comprehensive 8 survey about recent research activities on D2D networks will be explored with putting more 9 emphasis on utilizing mmWave and ML methods. After exploring existing D2D research directions 10 accompanied with their existing conventional solutions, we will show how different ML techniques 11 can be applied to enhance the D2D networks performance over using conventional ways. Then, still 12 open research directions in ML applications on D2D networks will be investigated including their 13 essential needs. A case study of applying multi-armed bandit (MAB) as an efficient online ML tool 14 to enhance the performance of neighbor discovery and selection (NDS) in mmWave D2D networks 15 will be presented. This case study will put emphasis on the high potency of using ML solutions 16 over using the conventional non-ML based methods for highly improving the average throughput 17 performance of mmWave NDS.

show abstract

“…2) Deep learning based algorithms: Deep learning technology, which is based on deep neural network (DNN), has gained in popularity over the last decade due to its superior performance over the conventional techniques [34]- [40]. It is possible for us to solve complex non-linear problems in an efficient manner by using a back-propagation (BP) algorithm [41], in which a trained DNN model can be employed for reducing the computational time required in practical systems [37].…”

Section: B the Existing Workmentioning

confidence: 99%

Deep Learning Based Power Allocation for Workload Driven Full-Duplex D2D-Aided Underlaying Networks

Zhang

Wang

et al. 2020

IEEE Trans. Veh. Technol.

View full text Add to dashboard Cite

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Cited by 123 publications

References 39 publications

Distance-Based Resource Allocation for Vehicle-to-Pedestrian Safety Communication

Distance-Based Resource Allocation for Vehicle-to-Pedestrian Safety Communication

Leveraging Machine-Learning for D2D Communications in 5G/Beyond 5G Networks

Deep Learning Based Power Allocation for Workload Driven Full-Duplex D2D-Aided Underlaying Networks

Contact Info

Product

Resources

About