2021 IEEE 94th Vehicular Technology Conference (VTC2021-Fall) 2021
DOI: 10.1109/vtc2021-fall52928.2021.9625582
|View full text |Cite
|
Sign up to set email alerts
|

Distributed Deep Reinforcement Learning Resource Allocation Scheme For Industry 4.0 Device-To-Device Scenarios

Abstract: This paper proposes a distributed deep reinforcement learning (DRL) methodology for autonomous mobile robots (AMRs) to manage radio resources in an indoor factory with no network infrastructure. Hence, deep neural networks (DNN) are used to optimize the decision policy of the robots, which will make decisions in a distributed manner without signalling exchange. To speed up the learning phase, a centralized training is adopted in which a single DNN is trained using the experience from all robots. Once completed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…Samples for updating the DDQN weights are then drawn randomly from the buffer thereby eliminating correlations between successive samples. The agents are trained using the reward function in (11) with ζ target = 0 bps/Hz.…”
Section: B Ddqn Design and Training Proceduresmentioning
confidence: 99%
See 1 more Smart Citation
“…Samples for updating the DDQN weights are then drawn randomly from the buffer thereby eliminating correlations between successive samples. The agents are trained using the reward function in (11) with ζ target = 0 bps/Hz.…”
Section: B Ddqn Design and Training Proceduresmentioning
confidence: 99%
“…To overcome this limitation, algorithms for resource allocation have been traditionally based on hard-coded heuristics [5] or using optimization techniques such as game theory [6], genetic algorithm [7] and geometric programming [8]. Over the last few years, the focus appears to have shifted towards machine learning-based algorithms [4] resulting in a large number of published works applying supervised [9], unsupervised [10] and reinforcement learning techniques [11] for resource allocation in different types of wireless systems.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning (RL) is adopted to resolve the joint user association and channel assignment problem. The authors in [8] use RL in a factory setting for time-slot selection for packet transmission. However, in [6]- [8], the authors do not take into consideration power allocation, which is certainly beneficial to manage the interference.…”
Section: Introductionmentioning
confidence: 99%
“…To overcome these limitations, we conjecture that reinforcement learning (RL) methods [10], [11] can be developed to perform resource selection, with potential performance improvement over existing approaches even with only quantized information. Moreover, a RL based method will eliminate the offline data generation requirement for the method in [9].…”
Section: Introductionmentioning
confidence: 99%