2020 4th International Conference on Advanced Systems and Emergent Technologies (IC_ASET) 2020
DOI: 10.1109/ic_aset49463.2020.9318258
|View full text |Cite
|
Sign up to set email alerts
|

A Policy Gradient Based Reinforcement Learning Method for Supply Chain Management

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 10 publications
0
4
0
Order By: Relevance
“…Similarly to BC, DRL has experienced exponential growth in the last decade [20,21], and there is currently strong interest in exploring the use of DRL to improve SC network performance (Table 1) [22][23][24][25][26]; however, [22] is the only work reported in the literature that introduces a distributed collaborative dynamic access control scheme utilizing DRL, and redefining network security architecture by combining anomaly detection, dynamic updates to user trust profiles, and collaborative adjustments for mitigation policies, to the best of authors knowledge. This scheme addresses the escalating challenge of insider threats in network security.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Similarly to BC, DRL has experienced exponential growth in the last decade [20,21], and there is currently strong interest in exploring the use of DRL to improve SC network performance (Table 1) [22][23][24][25][26]; however, [22] is the only work reported in the literature that introduces a distributed collaborative dynamic access control scheme utilizing DRL, and redefining network security architecture by combining anomaly detection, dynamic updates to user trust profiles, and collaborative adjustments for mitigation policies, to the best of authors knowledge. This scheme addresses the escalating challenge of insider threats in network security.…”
Section: Related Workmentioning
confidence: 99%
“…Traffic Allocation MADDPG based optimize traffic allocation policy for adaptive and automatic collaborative management, considering network security, network environment, and user requirements. 2020 [24] PPO Order Placement Development of a reinforcement learning agent for optimal order placement and inventory replenishment in SC management.…”
Section: Related Workmentioning
confidence: 99%
“…Intuition-based approaches are replaced by supply chain computerized solutions such as inventory management, warehousing, allocation, and replenishment. Hachaïchi et al (2020) aim at building a reinforcement learning agent capable of placing optimal orders for the sake of constructing a replenishment plan for next period. Current supply chain efficiency management methods cannot effectively control the risk caused by inefficient supply chain management.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Experiments show that using DQN for one node achieves better results than using the base-stock policy for all nodes. Hachaïchi et al (2020) use PPO and DDPG to solve an inventory replenishment problem in a two-echelon supply chain. There is one distribution center and three stores, with local capacitated stocks.…”
Section: Related Workmentioning
confidence: 99%