Research on Multi-Agent D2D Communication Resource Allocation Algorithm Based on A2C

Li, Xinzhou; Chen, Guifen; Wu, Guowei; Sun, Zhiyao; Chen, Guangjiao

doi:10.3390/electronics12020360

Cited by 15 publications

(2 citation statements)

References 46 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, researchers have applied reinforcement learning (RL) algorithms to Active SLAM tasks, such as Deep Q-Networks (DQN) [24], Advantage Actor-Critic (A2C) [25], and Dueling Double DQN (D3QN) [26]. Chen et al trained DQN and A2C agents by incorporating the underlying SLAM pose graph through Graph Neural Networks (GNNs) [27].…”

Section: Of 19mentioning

confidence: 99%

Exploration- and Exploitation-Driven Deep Deterministic Policy Gradient for Active SLAM in Unknown Indoor Environments

Zhao,

Hwang

2024

Electronics

View full text Add to dashboard Cite

This study proposes a solution for Active Simultaneous Localization and Mapping (Active SLAM) of robots in unknown indoor environments using a combination of Deep Deterministic Policy Gradient (DDPG) path planning and the Cartographer algorithm. To enhance the convergence speed of the DDPG network and minimize collisions with obstacles, we devised a unique reward function that integrates exploration and exploitation strategies. The exploration strategy allows the robot to achieve the shortest running time and movement trajectory, enabling efficient traversal of unmapped environments. Moreover, the exploitation strategy introduces active closed loops to enhance map accuracy. We conducted experiments using the simulation platform Gazebo to validate our proposed model. The experimental results demonstrate that our model surpasses other Active SLAM methods in exploring and mapping unknown environments, achieving significant grid completeness of 98.7%.

show abstract

Section: Of 19mentioning

confidence: 99%

Exploration- and Exploitation-Driven Deep Deterministic Policy Gradient for Active SLAM in Unknown Indoor Environments

Zhao,

Hwang

2024

Electronics

View full text Add to dashboard Cite

show abstract

“…Non-orthogonal multiple-access (NOMA) device-to-device communication is a promising technology for providing high data rates and secure transmission in short-range communication and improving the performance of fifth-generation (5G) wireless networks [7,8]. D2D communication enables direct connectivity among devices without support from the core network, offloading the cellular network and increasing the network's capacity [9]. Moreover, low latency, energy efficiency, and high data rate requirements can be achieved by combining D2D with cellular networks [10].…”

Section: Introductionmentioning

confidence: 99%

Modified Heuristic Computational Techniques for the Resource Optimization in Cognitive Radio Networks (CRNs)

et al. 2023

View full text Add to dashboard Cite

With the advancement of internet technologies and multimedia applications, the spectrum scarcity problem is becoming more acute. Thus, spectral-efficient schemes with minimal interference for IoT networks are required. Device-to-device communication (D2D) technology has the potential to solve the issue of spectrum scarcity in future wireless networks. Additionally, throughput is considered a non-convex and NP-hard problem, and heuristic approaches are effective in these scenarios. This paper presents two novel heuristic approaches for throughput optimization for D2D users with quality of service (QoS)-aware wireless communication for mobile users (MU): the modified whale colony optimization algorithm (MWOA) and modified non-domination sorted genetic algorithm (MNSGA). The performance of the proposed algorithms is analyzed to show that the proposed mode selection technique efficiently fulfills the QoS requirements. Simulation results show the performance of the proposed heuristic algorithms compared to other understudied approaches.

show abstract

Resource Allocation of Device‐To‐Device–Enabled Millimeter‐Wave Communication: A Deep Reinforcement Learning Approach

Md Bilal,

Velmurugan

2024

Int J Communication

View full text Add to dashboard Cite

Device‐to‐device (D2D) communication is a promising development in 5G networks, offering potential benefits such as increased data rates, reduced costs and latency, and improved energy efficiency (EE). This study analyzes the operation of millimeter‐wave (mmWave) in cellular networks. A client's device can establish a connection to either a base station or another client, facilitating D2D communication based on a distance threshold and accounting for interference. The research employs a deep reinforcement learning (DRL)–based resource allocation (RA) scheme for D2D‐enabled mmWave communications underlaying cellular networks. It evaluates the effectiveness of several metrics: coverage probability, area spectral efficiency, and network EE. Among networks limited by noise, the proposed strategy demonstrates the highest coverage probability performance. The paper also suggests an optimization approach based on the firefly algorithm for RA, taking into account the stochastic nature of wireless channels. An asynchronous advantage actor–critic (A3C) DRL algorithm is modeled for this purpose. The performance of the proposed scheme is compared with two existing algorithms: soft actor–critic and proximal policy optimization. Overall, the numerical results indicate that our proposed firefly algorithm–optimized A3C method outperforms the other analytical methods.

show abstract

Research on Multi-Agent D2D Communication Resource Allocation Algorithm Based on A2C

Cited by 15 publications

References 46 publications

Exploration- and Exploitation-Driven Deep Deterministic Policy Gradient for Active SLAM in Unknown Indoor Environments

Exploration- and Exploitation-Driven Deep Deterministic Policy Gradient for Active SLAM in Unknown Indoor Environments

Modified Heuristic Computational Techniques for the Resource Optimization in Cognitive Radio Networks (CRNs)

Resource Allocation of Device‐To‐Device–Enabled Millimeter‐Wave Communication: A Deep Reinforcement Learning Approach

Contact Info

Product

Resources

About