DRL‐Based Intelligent Resource Allocation for Diverse QoS in 5G and toward 6G Vehicular Networks: A Comprehensive Survey

Nguyen, Ha Q.; Nguyen, Minh Tuan; T., Hai; Hua, Hoang T.; Nguyen, Cuong V.

doi:10.1155/2021/5051328

Cited by 26 publications

(14 citation statements)

References 106 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…5 for allocating power to maximize the average sum rate for each UE have been more realistic. The simulations have been performed for evaluating the proposed TD3 algorithm with respect to two DRL-based algorithms: traditional DQN [23] and DDPG [29], as well as two traditional algorithms: WMMSE [22] and FP [21] which are the benchmarks in order to evaluate our proposed TD3 algorithm. In the simulation, we have considered 25 RRHs with 1 Km serving ranger per RRH and the number of total UE from 25 to 125 that are equally distributed among RRHs.…”

Section: Algorithm 1: Td3 Algorithmmentioning

confidence: 99%

“…Hence, the maximum travelled distance within the time slots is 14 m. As a result, UE association is considered with fixed RRH. The system parameters for simulations except the mobility model follow as [23,29] for ensuring the fair comparison, presented in Table 3.…”

Section: Algorithm 1: Td3 Algorithmmentioning

confidence: 99%

“…The internet of things (IoT), which emphasizes the goal and mission of a worldwide infrastructure connecting physical items known as things and uses internet protocol to allow them to optimization is done by convex optimization. However, most of the formulated problems, for example, dynamic PA, maximization of sum rate is strongly nonconvex as well as nondeterministic polynomial-time hardness (NP-hard) [23]. As a result, it isn't easy to get the optimized result [24].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Twin Delayed DDPG based Dynamic Power Allocation for Mobility in IoRT

Kabir

Tham

Chang

2023

JCOMSS

View full text Add to dashboard Cite

The internet of robotic things (IoRT) is a modern as well as fast-evolving technology employed in abundant socioeconomical aspects which connect user equipment (UE) for communication and data transfer among each other. For ensuring the quality of service (QoS) in IoRT applications, radio resources, for example, transmitting power allocation (PA), interference management, throughput maximization etc., should be efficiently employed and allocated among UE. Traditionally, resource allocation has been formulated using optimization problems, which are then solved using mathematical computer techniques. However, those optimization problems are generally nonconvex as well as nondeterministic polynomial-time hardness (NP-hard). In this paper, one of the most crucial challenges in radio resource management is the emitting power of an antenna called PA, considering that the interfering multiple access channel (IMAC) has been considered. In addition, UE has a natural movement behavior that directly impacts the channel condition between remote radio head (RRH) and UE. Additionally, we have considered two well-known UE mobility models i) random walk and ii) modified Gauss-Markov (GM). As a result, the simulation environment is more realistic and complex. A data-driven as well as model-free continuous action based deep reinforcement learning algorithm called twin delayed deep deterministic policy gradient (TD3) has been proposed that is the combination of policy gradient, actor-critics, as well as double deep Q-learning (DDQL). It optimizes the PA for i) stationary UE, ii) the UE movements according to random walk model, and ii) the UE movement based on the modified GM model. Simulation results show that the proposed TD3 method outperforms model-based techniques like weighted MMSE (WMMSE) and fractional programming (FP) as well as model-free algorithms, for example, deep Q network (DQN) and DDPG in terms of average sum-rate performance.

show abstract

Section: Algorithm 1: Td3 Algorithmmentioning

confidence: 99%

Section: Algorithm 1: Td3 Algorithmmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Twin Delayed DDPG based Dynamic Power Allocation for Mobility in IoRT

Kabir

Tham

Chang

2023

JCOMSS

View full text Add to dashboard Cite

show abstract

“…Since the power consumption of the MTC terminal is mainly in data transmission, this article considers the power consumption of the MTC terminal in data transmission (Nguyen et al, 2021;Wu, Li, & Jiang, 2022;Li L et al, 2020). Therefore, when the terminal transmits bit data and transmit power, the following power consumption formula is used to evaluate the power consumption:…”

Section: Energy Consumption Modelmentioning

confidence: 99%

Performance Art Video Action Management Oriented to 6G Wireless Transmission Technology

Wang¹,

Cho²

2023

JHASS

View full text Add to dashboard Cite

With the development of 5G communication networks, the development prospects, capability requirements and key technologies of the next-generation communication system 6G have become new hotspots. This article expands the coverage of the communication system through 6G, realizes the wireless transmission technology, builds the ideal of an intelligent mobile society everywhere, and analyzes the development trend of 6G technology, the challenges it faces, and the possible wireless transmission technology. Wireless transmission technology is widely used in various fields of production and life. Its various technical requirements and terminal energy limitations restrict the application and development of communications. This article focuses on the high-efficiency optimization scheme that combines 6G wireless transmission technology to improve the utilization efficiency of network resources. In addition, this thesis also discusses video action management in public performance art. Based on public performance art, the research on video action recognition can be summarized into three levels from simple to complex according to the content of the action, namely mobile vision, action vision and action vision. Video action vision is currently being studied. In the recognition process, most of the action frame information in the video is repetitive or has a low correlation with action recognition, which not only increases the complexity of calculation, but also affects the accuracy of action recognition. In the process of feature selection and understanding, the main methods include characterization fusion and research based on gesture features. Based on the introduction of 6G wireless transmission technology, this article introduces the technological development of 6G network, which promotes the rapid development of public performance art video action management.

show abstract

“…In [ 24 ], power allocation optimization is conducted by convex optimization. However, most of the formulated problems, for example, dynamic PA, maximization of the coverage area, traffic offloading, traffic load balancing with user association, maximization of sum rate, etc., are strongly nonconvex as well as nondeterministic polynomial-time hardness (NP-hard) [ 25 ]. In this research, we investigate optimizing the energy efficiency and throughput of UE as well as serving cell (SC) of the MDRU-aided two-tier HetNet scenario by ensuring the QoS of mobility-aware UEs where user association and power allocation for each UE have been considered without knowing the environmental priori knowledge.…”

Section: Introductionmentioning

confidence: 99%

Mobility-Aware Resource Allocation in IoRT Network for Post-Disaster Communications with Parameterized Reinforcement Learning

Kabir

Tham

Chang

et al. 2023

Sensors

View full text Add to dashboard Cite

Natural disasters, including earthquakes, floods, landslides, tsunamis, wildfires, and hurricanes, have become more common in recent years due to rapid climate change. For Post-Disaster Management (PDM), authorities deploy various types of user equipment (UE) for the search and rescue operation, for example, search and rescue robots, drones, medical robots, smartphones, etc., via the Internet of Robotic Things (IoRT) supported by cellular 4G/LTE/5G and beyond or other wireless technologies. For uninterrupted communication services, movable and deployable resource units (MDRUs) have been utilized where the base stations are damaged due to the disaster. In addition, power optimization of the networks by satisfying the quality of service (QoS) of each UE is a crucial challenge because of the electricity crisis after the disaster. In order to optimize the energy efficiency, UE throughput, and serving cell (SC) throughput by considering the stationary as well as movable UE without knowing the environmental priori knowledge in MDRUs aided two-tier heterogeneous networks (HetsNets) of IoRT, the optimization problem has been formulated based on emitting power allocation and user association combinedly in this article. This optimization problem is nonconvex and NP-hard where parameterized (discrete: user association and continuous: power allocation) action space is deployed. The new model-free hybrid action space-based algorithm called multi-pass deep Q network (MP-DQN) is developed to optimize this complex problem. Simulations results demonstrate that the proposed MP-DQN outperforms the parameterized deep Q network (P-DQN) approach, which is well known for solving parameterized action space, DQN, as well as traditional algorithms in terms of reward, average energy efficiency, UE throughput, and SC throughput for motionless as well as moveable UE.

show abstract

DRL‐Based Intelligent Resource Allocation for Diverse QoS in 5G and toward 6G Vehicular Networks: A Comprehensive Survey

Cited by 26 publications

References 106 publications

Twin Delayed DDPG based Dynamic Power Allocation for Mobility in IoRT

Twin Delayed DDPG based Dynamic Power Allocation for Mobility in IoRT

Performance Art Video Action Management Oriented to 6G Wireless Transmission Technology

Mobility-Aware Resource Allocation in IoRT Network for Post-Disaster Communications with Parameterized Reinforcement Learning

Contact Info

Product

Resources

About