2021
DOI: 10.1109/jsac.2021.3087264
|View full text |Cite
|
Sign up to set email alerts
|

Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 57 publications
(12 citation statements)
references
References 38 publications
0
12
0
Order By: Relevance
“…Different from the existing DRL algorithms which consider either purely discrete [22], [30], [31] or purely continuous actions [12], [32], [33], we here study a more practical DRL setting with a hybrid discrete-continuous action for improving the training performance. Even though such a hybrid action setting has been previously mentioned in a few related works such as [34], a holistic investigation on the sampling of discrete and continuous actions has not been given. Therefore, we propose a new parameterized advantage actor critic (A2C) algorithm to optimize the system latency with actor and critic designs.…”
Section: Drl Design With Parameterized A2c For Bflmentioning
confidence: 99%
“…Different from the existing DRL algorithms which consider either purely discrete [22], [30], [31] or purely continuous actions [12], [32], [33], we here study a more practical DRL setting with a hybrid discrete-continuous action for improving the training performance. Even though such a hybrid action setting has been previously mentioned in a few related works such as [34], a holistic investigation on the sampling of discrete and continuous actions has not been given. Therefore, we propose a new parameterized advantage actor critic (A2C) algorithm to optimize the system latency with actor and critic designs.…”
Section: Drl Design With Parameterized A2c For Bflmentioning
confidence: 99%
“…. , R M } by (24); problem that M agents cannot be constructed simultaneously due to the limited of CPU resources and storage space, when M is large.…”
Section: Model Training and Parameter Migrationmentioning
confidence: 99%
“…Dalgkitsis et al [23] leveraged the deep deterministic policy gradient algorithm to implement dynamic resource-aware VNF placement. Akbari et al [24] studied a VNF placement and scheduling problem in an industrial internet-of-things network, and applied an actor-critic RL to jointly minimize the VNF cost and the age of information.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, dealing with age-optimal scheduling problem using reinforcement learning approaches in an unknown environment has recently drawn great attention [6], [20]- [28] and to the best of our knowledge, the first application of RL approaches to the problem with a minimum AoI criterion appeared in [6], which employed the average-cost SARSA with softmax algorithm to learn the system parameters and the transmission policy under hybrid ARQ (HARQ) protocols.…”
Section: Introductionmentioning
confidence: 99%
“…In [27], an underwater linear network was considered, in which the authors developed an actor-critic DRL based on a deep deterministic policy gradient (DDPG) method to minimize the normalized weighted sum AoI. In [28], the authors developed single-agent and cooperative multi-agent virtual network function (VNF) placement utilizing DRL method to minimize VNF placement cost, scheduling cost, and average AoI in industrial internet of things (IIoT). However, none of the multi-user system works consider NOMA transmission scheme when using RL to solve AoI minimization problems in the unknown environment.…”
Section: Introductionmentioning
confidence: 99%