2022
DOI: 10.1016/j.knosys.2022.108765
|View full text |Cite
|
Sign up to set email alerts
|

Average reward adjusted deep reinforcement learning for order release planning in manufacturing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(1 citation statement)
references
References 49 publications
0
1
0
Order By: Relevance
“…On the basis of these findings, Mahesh, (2018) proposes that in reinforcement learning, a software agent determines the ideal behaviour in a specific context for a particular problem. This view is supported by Schneckenreither, Haeussler and Peiró, (2022) who writes that the agent takes the input and decides the best action for the problem and then based on the result of the action the agent then receives simple reward feedback to allow it to learn from its behaviour. In unsupervised learning the outputs (labels) aren't known, as the models find patterns and structure from the data without any assistance (Greener et al, 2022).…”
Section: Literature Reviewmentioning
confidence: 99%
“…On the basis of these findings, Mahesh, (2018) proposes that in reinforcement learning, a software agent determines the ideal behaviour in a specific context for a particular problem. This view is supported by Schneckenreither, Haeussler and Peiró, (2022) who writes that the agent takes the input and decides the best action for the problem and then based on the result of the action the agent then receives simple reward feedback to allow it to learn from its behaviour. In unsupervised learning the outputs (labels) aren't known, as the models find patterns and structure from the data without any assistance (Greener et al, 2022).…”
Section: Literature Reviewmentioning
confidence: 99%