2022
DOI: 10.1287/msom.2021.1064
|View full text |Cite
|
Sign up to set email alerts
|

Can Deep Reinforcement Learning Improve Inventory Management? Performance on Lost Sales, Dual-Sourcing, and Multi-Echelon Problems

Abstract: Problem definition: Is deep reinforcement learning (DRL) effective at solving inventory problems? Academic/practical relevance: Given that DRL has successfully been applied in computer games and robotics, supply chain researchers and companies are interested in its potential in inventory management. We provide a rigorous performance evaluation of DRL in three classic and intractable inventory problems: lost sales, dual sourcing, and multi-echelon inventory management. Methodology: We model each inventory probl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 84 publications
(33 citation statements)
references
References 51 publications
0
33
0
Order By: Relevance
“…To the best of our knowledge, ours is the first approach that shows how neural networks can compete with and even outperform inventory-management approaches developed after years of specialized research on dual sourcing. From a methodological perspective, our paper is close to Gijsbrechts et al (2020), who apply RL-based policy learning to a variety of inventory management problems. In particular, the authors show that the presented Asynchronous Advantage Actor-Critic (A3C) algorithm competes favorably with several known heuristics for lost-sales, dual-sourcing and multi-echelon models.…”
Section: Inventory Management With Dual Sourcingmentioning
confidence: 99%
See 2 more Smart Citations
“…To the best of our knowledge, ours is the first approach that shows how neural networks can compete with and even outperform inventory-management approaches developed after years of specialized research on dual sourcing. From a methodological perspective, our paper is close to Gijsbrechts et al (2020), who apply RL-based policy learning to a variety of inventory management problems. In particular, the authors show that the presented Asynchronous Advantage Actor-Critic (A3C) algorithm competes favorably with several known heuristics for lost-sales, dual-sourcing and multi-echelon models.…”
Section: Inventory Management With Dual Sourcingmentioning
confidence: 99%
“…Theoretically, neural networks are able to represent such policies under mild conditions, as specified by universal approximation theorems (Hornik 1991, Hanin and Sellke 2017, Park et al 2020. In practice, however, designing neural networks that represent optimal policies of discrete-time stochastic dynamical systems has been challenging (Gijsbrechts et al 2020, Boute et al 2021b. We next outline an algorithm that does so efficiently.…”
Section: Solving Discrete Stochastic Optimization Problems With Neura...mentioning
confidence: 99%
See 1 more Smart Citation
“…We now present related works that handle the problem using an order-based approach. Gijsbrechts et al (2019) propose a proof of concept by using Deep-RL on three different problems: dual-sourcing or dual-mode, lost sales, and multi-echelon inventory models. In the multi-echelon setup, demands are uncertain and regular, and lead times are determin-istic.…”
Section: Related Workmentioning
confidence: 99%
“…Some works in the literature use Deep RL on related problems, but they usually deal with smaller supply chain networks, with two-echelon or serial supply chains (Oroojlooyjadid 2019;Kemmer et al 2018;Hutse 2019;Gijsbrechts et al 2019;Peng et al 2019). Considering serial supply chains, the dimensionality of the action space is close to the number of echelons.…”
Section: Introductionmentioning
confidence: 99%