2020
DOI: 10.1109/tvt.2019.2961405
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Abstract: Device-to-device (D2D) communication underlay cellular networks is a promising technique to improve spectrum efficiency. In this situation, D2D transmission may cause severe interference to both the cellular and other D2D links, which imposes a great technical challenge to spectrum allocation. Existing centralized schemes require global information, which causes a large signaling overhead. While existing distributed schemes requires frequent information exchange among D2D users and cannot achieve global optimi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
53
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 123 publications
(53 citation statements)
references
References 39 publications
0
53
0
Order By: Relevance
“…Methods ranging from heuristic optimization [12] to machine learning [33], have been used under varying sets of users and their requirements. Machine learning, being the current hot topic in this domain; centralized vs. distributed reinforcement learning, multi-agent deep reinforcement learning [34], and other methods that could provide a faster convergence with better performance than traditional methods. Relay-assisted [35], out-of-coverage scenarios [36] and energy efficient resource allocation [37] is currently being researched upon.…”
Section: Discussion and Future Workmentioning
confidence: 99%
“…Methods ranging from heuristic optimization [12] to machine learning [33], have been used under varying sets of users and their requirements. Machine learning, being the current hot topic in this domain; centralized vs. distributed reinforcement learning, multi-agent deep reinforcement learning [34], and other methods that could provide a faster convergence with better performance than traditional methods. Relay-assisted [35], out-of-coverage scenarios [36] and energy efficient resource allocation [37] is currently being researched upon.…”
Section: Discussion and Future Workmentioning
confidence: 99%
“…A survey of different popular and AI-based interference mitigation and RA approaches developed in D2D communications is provided in [24]. Additionally, the multi-agent actor critic (MAAC) is a newly proposed algorithm in [25] to mitigate interference by efficiently distributing the spectrum allocation. Moreover, the same paper proposes the neighbor-agent actor critic (NAAC) that uses neighbor users' historical information for centralized training leading to outage probability reduction and sum rate improvement for D2D links.…”
Section: For Interference Mitigationmentioning
confidence: 99%
“…2) Deep learning based algorithms: Deep learning technology, which is based on deep neural network (DNN), has gained in popularity over the last decade due to its superior performance over the conventional techniques [34]- [40]. It is possible for us to solve complex non-linear problems in an efficient manner by using a back-propagation (BP) algorithm [41], in which a trained DNN model can be employed for reducing the computational time required in practical systems [37].…”
Section: B the Existing Workmentioning
confidence: 99%