2022
DOI: 10.48550/arxiv.2201.00548
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism

Abstract: The dynamic job-shop scheduling problem (DJSP) is a class of scheduling tasks that specifically consider the inherent uncertainties such as changing order requirements and possible machine breakdown in realistic smart manufacturing settings. Since traditional methods cannot dynamically generate effective scheduling strategies in face of the disturbance of environments, we formulate the DJSP as a Markov decision process (MDP) to be tackled by reinforcement learning (RL). For this purpose, we propose a flexible … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 43 publications
0
8
0
Order By: Relevance
“…Where the PPO method, derived from the work of Park [23] et al is one of the methods that achieved SOTA performance. The D3QPN method, derived from the work of Zeng [27] et al is one of the most applicable value-based RL methods for DJSP. Bellman means to replace the average reward calculation in TOFA with Bellman optimal equation.…”
Section: Results On Public Instancesmentioning
confidence: 99%
“…Where the PPO method, derived from the work of Park [23] et al is one of the methods that achieved SOTA performance. The D3QPN method, derived from the work of Zeng [27] et al is one of the most applicable value-based RL methods for DJSP. Bellman means to replace the average reward calculation in TOFA with Bellman optimal equation.…”
Section: Results On Public Instancesmentioning
confidence: 99%
“…However, the disjunctive graphs only reflect the static features in JSSP, failing to represent the dynamic features in DJSSP. Therefore, several following attributes are added to the vertices in the disjunctive graph to represent the dynamic features [36]:…”
Section: Mathematical Modelmentioning
confidence: 99%
“…Zhao et al proposed the deep Q-network (DQN) to improve the performance of the adaptive scheduling algorithm in dynamic smart manufacturing [35]. Wang et al [36] and Zeng et al [37] proposed the dual Q-learning (D-Q) method as the solution of the dynamic job-shop scheduling problem. Luo et al [38] proposed a two-hierarchy deep Q-network to deal with flexible job-shop scheduling problems with the disturbance of new jobs.…”
Section: Introductionmentioning
confidence: 99%