Joint Deep Reinforcement Learning and Unsupervised Learning for Channel Selection and Power Control in D2D Networks

Sun, Ming; Jin, Yanhui; Wang, Shumei; Mei, Erzhuang

doi:10.3390/e24121722

Cited by 5 publications

(2 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It is well known that the deep reinforcement learning has excellent abilities in environment interaction and dynamic perception. In the dynamic wireless network environments, the agent can learn through interaction with the environments [11], [12]. Currently, the deep Q network (DQN), as a research hotspot in deep reinforcement learning, has been widely used for the resource allocation in wireless communications.…”

Section: Introductionmentioning

confidence: 99%

Joint DDPG and Unsupervised Learning for Channel Allocation and Power Control in Centralized Wireless Cellular Networks

et al. 2023

View full text Add to dashboard Cite

In order to solve the resource allocation problem in scenarios of centralized wireless cellular communication with multiple cells, users and channels, a novel resource allocation algorithm based on joint Deep Deterministic Policy Gradient (DDPG) reinforcement learning and unsupervised learning is proposed. Firstly, the proposed algorithm builds a channel allocation deep neural network based on DDPG to provide an optimized channel allocation scheme. Secondly, the proposed algorithm constructs a power control deep neural network based on unsupervised learning to provide an optimized power control scheme. In order to make the unsupervised learning have perceptions on dynamic wireless environments, the experience replay is executed twice to train the channel allocation deep neural network with the DDPG reinforcement learning and the power control deep neural network with the unsupervised learning, respectively. Because the proposed joint algorithm combines the dynamic perception ability of the DDPG reinforcement learning and the continuous optimization ability of unsupervised learning, the energy efficiency can be maximized effectively. Simulation results show that the proposed algorithm outperforms other algorithms in terms of energy efficiency and transmit rate in time-varying dynamic environments.INDEX TERMS Deep reinforcement learning, unsupervised learning, channel allocation, power control, wireless cellular networks.

show abstract

Section: Introductionmentioning

confidence: 99%

Joint DDPG and Unsupervised Learning for Channel Allocation and Power Control in Centralized Wireless Cellular Networks

et al. 2023

View full text Add to dashboard Cite

show abstract

“…According to the statistics, it is estimated that, in 2040, the number of intelligent terminal connections will increase by more than 30 times compared with 2022, and the average monthly traffic will increase by more than 130 times [5]. Finally, in the 6G era, there will be an Internet market of "hundreds the whole wireless communication system and reduces the computing pressure of the base station to a certain extent, while improving the spectrum resource utilization and throughput of the system [11].…”

Section: Introductionmentioning

confidence: 99%

D2D Communication Network Interference Coordination Scheme Based on Improved Stackelberg

Chen

et al. 2023

Sustainability

View full text Add to dashboard Cite

The sudden explosive growth of data in intelligent devices and existing communication networks has brought great challenges to existing communication networks. On the one hand, D2D (device to device) technology greatly improves the utilization of spectrum resources; on the other hand, it improves the communication quality of users. It has become an important part of the future communication network. Aiming at the problem that the existing D2D communication network system has complex user interference, and the communication quality of cellular users is difficult to guarantee, a D2D communication network interference coordination scheme based on improved Stackelberg is proposed. Using resource allocation and power control to solve the interference coordination problem, this paper proposes an improved Stackelberg model based on DQN (deep Q network), establishes the master–slave game between cellular users and multiplexing resource users (D2D users; relay communication users), optimizes the cost parameters in the Stackelberg mode and improves the transmission power and resource allocation scheme of multiplexing resource users. The simulation results show that compared with similar algorithms, the algorithm proposed in this paper has the best performance in guaranteeing the QoS of cellular users in the system and has good interference management capability for D2D communication networks.

show abstract

Variable Hybrid Action Space Deep Q-Networks for Optimal Power Allocation and User Association in Heterogeneous Networks

Valasa,

Lokam,

Bhar

2024

Wireless Pers Commun

View full text Add to dashboard Cite

Joint Deep Reinforcement Learning and Unsupervised Learning for Channel Selection and Power Control in D2D Networks

Cited by 5 publications

References 31 publications

Joint DDPG and Unsupervised Learning for Channel Allocation and Power Control in Centralized Wireless Cellular Networks

Joint DDPG and Unsupervised Learning for Channel Allocation and Power Control in Centralized Wireless Cellular Networks

D2D Communication Network Interference Coordination Scheme Based on Improved Stackelberg

Variable Hybrid Action Space Deep Q-Networks for Optimal Power Allocation and User Association in Heterogeneous Networks

Contact Info

Product

Resources

About