2021
DOI: 10.1080/00207543.2021.1943037
|View full text |Cite
|
Sign up to set email alerts
|

Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
22
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 40 publications
(22 citation statements)
references
References 44 publications
0
22
0
Order By: Relevance
“…, 2020; Shi et al. , 2020; Yang and Xu, 2021). The A2C algorithm is a deep reinforcement learning method that combines two kinds of reinforcement learning algorithms (i.e.…”
Section: Science Mapping Of DL Applications In Manufacturing Operatio...mentioning
confidence: 99%
“…, 2020; Shi et al. , 2020; Yang and Xu, 2021). The A2C algorithm is a deep reinforcement learning method that combines two kinds of reinforcement learning algorithms (i.e.…”
Section: Science Mapping Of DL Applications In Manufacturing Operatio...mentioning
confidence: 99%
“…Furthermore, dynamic environments complicate the model and run-time limits the application of mathematical optimization to this NP-hard problem [34]. New methodologies such as deep reinforcement learning should be explored to study the real-time optimization and collaborative control of RMS in intelligent manufacturing [35].…”
mentioning
confidence: 99%
“…Despite very early attempts that couple machine learning with optimization methods [2][3][4], recent deep learning based methods present promising results on some COPs, which are comparable to classic optimization methods [5,6]. Their success has inspired miscellaneous deep models to cope with specific COPs, e.g., vehicle routing [7][8][9][10][11], scheduling [12][13][14][15], and enhance algorithmic paradigms for general COPs, e.g., integer programming (IP) [16][17][18][19], stochastic integer programming (SIP) [20][21][22][23][24]. All the above studies on the novel and active NCO domain greatly enrich the learning for optimization community.…”
Section: Motivationsmentioning
confidence: 99%
“…The remaining nodes are only for transition without demands. We set two types of commodities and their quantities are uniformly sampled from [5,15] in each scenario. For each edge, we uniformly sample the opening cost, shipping cost and capacity from [3,11], [5,11] and [10,41], respectively.…”
Section: Settingsmentioning
confidence: 99%
See 1 more Smart Citation