Deep Reinforcement Learning for Radio Resource Allocation and Management in Next Generation Heterogeneous Wireless Networks: A Survey

Alwarafy, Abdulmalik; Abdallah, Mohamed; Çiftler, Bekir Sait; Al-Fuqaha, Ala; Hamdi, Mounir

doi:10.36227/techrxiv.14672643

Cited by 12 publications

(22 citation statements)

References 118 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…According to the last numerical comparisons of the RL and DRL algorithm complexities [81], the worst‐case computational complexity of the approaches, using RL used algorithms as Q‐learning, SARSA, Actor–Critic and Monte Carlo, is O(|S|*|A|), where S and A are the sizes of the space of states and actions, respectively, whereas the worst‐case complexity of DDQN, DQN, and duelling DQN‐based approaches is O(|S| 2 ). They show a dependence on the size of spaces of states.…”

Section: Simulation Results and Discussionmentioning

confidence: 99%

“…To deal with scalability issues faced by RL‐based approaches, the approaches in Ref. [55–58, 63, 65–68] have employed DRL‐based approaches based on DQN with one or two NNs, DDQN, Duelling DQN, and LSTM‐based A2C to allocate resources to RAN and (fog) edge network slices [81].…”

Section: Literature Reviewmentioning

confidence: 99%

“…In Ref. [80], the authors propose a single‐agent DQN algorithm based on Monte Carlo Tree Search to address the problem of dynamic spectrum sharing between 4G LTE and 5G NR (New Radio) systems [81]. So far, all the summarised approaches done on the recent network Frequency assignment (FA) employed its own discrete state space with a certain number and depending on the network configuration, whereas the reward function is a continuous function defined in terms of the QoS requirement, and channels' supplies, and others, such as if the reward is online traffic real reward or not and if it is online traffic prediction reward or not.…”

Section: Literature Reviewmentioning

confidence: 99%

“…Related studies can be found in the following sources but not restricted to Ref. [7–87]. Each of the proposed techniques has its own advantages and disadvantages, and none of them can be considered on its own to fully accomplish the realisation of CR systems.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Distributed reinforcement learning for dynamic spectrum allocation in cognitive radio‐based internet of things

Elhachmi

2022

IET Networks

View full text Add to dashboard Cite

Cognitive Radio (CR) with other advancements such as the Internet of things and machine learning has recently emerged as the main involved technique to use spectrum in an efficient manner. It can access the spectrum in a fully dynamic way and exploit the unused spectrum resources without creating any harm to cognitive users. In this paper, the authors develop a CR access strategy founded on the implementation of an efficient Deep Multi‐user Reinforcement Learning algorithm based on a combination of a Deep neural network, Q‐learning, and cooperative multi‐agent systems. The proposed approach consists of two stages: the user choice algorithm to set up an agent's activation order, and the frequency choice method to select the optimal channel on the appropriate bandwidth. Reasonable implementation is proposed, and the obtained results demonstrate that the authors’ approach can improve wireless communication for all CR terminals. It shows satisfactory performances in terms of user satisfaction degree and the number of used channels and can keep the channel allocation plan always in the appropriate state.

show abstract

Section: Simulation Results and Discussionmentioning

confidence: 99%

Section: Literature Reviewmentioning

confidence: 99%

Section: Literature Reviewmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Distributed reinforcement learning for dynamic spectrum allocation in cognitive radio‐based internet of things

Elhachmi

2022

IET Networks

View full text Add to dashboard Cite

show abstract

“…• Mathematical optimization or ML approaches? Next-generation wireless networks are more complicated due to their large-scale, versatile, and heterogeneous nature [98], [218]. Conventional mathematical optimization approach requires complete or quasi-complete knowledge of the wireless environment, which is non-trivial to obtain in heterogeneous scenarios.…”

Section: Discussion and Outlookmentioning

confidence: 99%

Evolution of NOMA Toward Next Generation Multiple Access (NGMA) for 6G

Liu,

Zhang,

et al. 2021

Preprint

View full text Add to dashboard Cite

Due to the explosive growth in the number of wireless devices and diverse wireless services, such as virtual/augmented reality and Internet-of-Everything, next generation wireless networks face unprecedented challenges caused by heterogeneous data traffic, massive connectivity, and ultra-high bandwidth efficiency and ultra-low latency requirements. To address these challenges, advanced multiple access schemes are expected to be developed, namely next generation multiple access (NGMA), which are capable of supporting massive numbers of users in a more resource-and complexity-efficient manner than existing multiple access schemes. As the research on NGMA is in a very early stage, in this paper, we explore the evolution of NGMA with a particular focus on non-orthogonal multiple access (NOMA), i.e., the transition from NOMA to NGMA. In particular, we first review the fundamental capacity limits of NOMA, elaborate the new requirements for NGMA, and discuss several possible candidate techniques. Moreover, given the high compatibility and flexibility of NOMA, we provide an overview of current research efforts on multi-antenna techniques for NOMA, promising future application scenarios of NOMA, and the interplay between NOMA and other emerging physical layer techniques. Furthermore, we discuss advanced mathematical tools for facilitating the design of NOMA communication systems, including conventional optimization approaches and new machine learning techniques. Next, we propose

show abstract

Slice admission control in 5G cloud radio access network using deep reinforcement learning: A survey

Khani,

Jamali,

Sohrabi

et al. 2024

Int J Communication

View full text Add to dashboard Cite

SummaryThe emergence of 5G networks has increased the demand for network resources, making efficient resource management crucial. Slice admission control (SAC) is a process that governs the creation and allocation of virtualized network environments, known as “network slices,” which can be tailored to meet specific user requirements. However, traditional SAC methods face dynamic and heterogeneous challenges in wireless networks, especially in cloud radio access networks (C‐RANs). To address this issue, machine learning (ML) techniques, particularly deep reinforcement learning (DRL), have been proposed as powerful tools for optimizing SAC. DRL‐based approaches enable SAC systems to learn from previous interactions with the network environment and dynamically adapt to changing network conditions. This review article comprehensively explains the current state‐of‐the‐art DRL‐based SAC, focusing on C‐RANs. The article identifies key challenges and future research directions and highlights the potential benefits of using DRL for SAC, including improved network performance and efficiency. However, deploying these systems in real‐world scenarios presents several challenges and trade‐offs that need to be carefully considered. Further research and development are required to address these challenges and ensure the successful deployment of DRL‐based SAC systems in wireless networks.

show abstract

Deep Reinforcement Learning for Radio Resource Allocation and Management in Next Generation Heterogeneous Wireless Networks: A Survey

Cited by 12 publications

References 118 publications

Distributed reinforcement learning for dynamic spectrum allocation in cognitive radio‐based internet of things

Distributed reinforcement learning for dynamic spectrum allocation in cognitive radio‐based internet of things

Evolution of NOMA Toward Next Generation Multiple Access (NGMA) for 6G

Slice admission control in 5G cloud radio access network using deep reinforcement learning: A survey

Contact Info

Product

Resources

About