2021
DOI: 10.1016/j.jmsy.2021.02.014
|View full text |Cite
|
Sign up to set email alerts
|

A study on a Q-Learning algorithm application to a manufacturing assembly problem

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…Many authors explored the deep Q learning algorithm and exploited it in a wide variety of complex applications, like collaborative business processes with cloud services, manufacturing assembly programs, many robotic applications, automated trading in equity stock markets, and many more [24][25][26][27][28]. Chatterjee et al [29] discussed deep reinforcement learning for the application where the most phishing activities are taking place on websites and detecting malicious URLs.…”
Section: Review Of Related Workmentioning
confidence: 99%
“…Many authors explored the deep Q learning algorithm and exploited it in a wide variety of complex applications, like collaborative business processes with cloud services, manufacturing assembly programs, many robotic applications, automated trading in equity stock markets, and many more [24][25][26][27][28]. Chatterjee et al [29] discussed deep reinforcement learning for the application where the most phishing activities are taking place on websites and detecting malicious URLs.…”
Section: Review Of Related Workmentioning
confidence: 99%
“…The studied ASP problem was based on the assembly case study proposed by Neves et al [24] of an airplane toy from the Yale-CMU-Berkeley Object and Benchmark Dataset [25,26], Fig 2. This assembly was optimized through the usage of the deep reinforcement learning algorithms A2C [21], DQN [1], and Rainbow [23], provided in the RLlib python library [27], and the tabular Q-Learning algorithm [28].…”
Section: Case Studymentioning
confidence: 99%
“…The MDP formulation of the airplane assembly sequence planning problem was based on the approach proposed by Neves et al [24].…”
Section: Mdp Formulationmentioning
confidence: 99%
See 1 more Smart Citation
“…As one of most commonly used RL algorithms, the Q-Learning algorithm, which is based on value, reference strategy learning and TD method [7], has been widely applied to route planning, manufacturing and assembly, and dynamic train scheduling [8][9][10]. Many researchers have been dedicated to improving the low exploration efficiency problem of the traditional Q-Learning algorithm.…”
Section: Introductionmentioning
confidence: 99%