2020
DOI: 10.1007/978-3-030-53552-0_19
|View full text |Cite
|
Sign up to set email alerts
|

Online Stacking Using RL with Positional and Tactical Features

Abstract: We study the scenario where some items are stored temporarily in stacks and where it is not allowed to put an item on top of another item leaving earlier. An arriving item is assigned to a stack based only on information on the arrival and departure times for the new item and items currently stored. The objective is to minimize the maximum number of stacks used over time. This problem is referred to as online stacking. We use Reinforcement Learning (RL) techniques to improve heuristics earlier presented in the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
1
1
1

Relationship

3
0

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…Simple heuristics for online stacking similar to Algorithm Online have been presented by Borgman et al [4], Duinkerken et al [8], Hamdi et al [9], and Wang et al [18] without providing a proof of asymptotic optimality. Olsen shows in [12] how Reinforcement Learning can be used to improve simple online stacking heuristics. Finally, we mention the work of Rei and Pedroso [16] and König et al [11] on related problems within the steel industry as well as the PhD thesis by Pacino [14] on container ship stowage.…”
Section: Related Workmentioning
confidence: 99%
“…Simple heuristics for online stacking similar to Algorithm Online have been presented by Borgman et al [4], Duinkerken et al [8], Hamdi et al [9], and Wang et al [18] without providing a proof of asymptotic optimality. Olsen shows in [12] how Reinforcement Learning can be used to improve simple online stacking heuristics. Finally, we mention the work of Rei and Pedroso [16] and König et al [11] on related problems within the steel industry as well as the PhD thesis by Pacino [14] on container ship stowage.…”
Section: Related Workmentioning
confidence: 99%
“…Olsen and Gross [14] have developed a polynomial time algorithm for online coloring with a competitive ratio that converges to 1 in probability if the endpoints of the storage time intervals are picked independently and uniformly at random. Olsen [13] has also shown how to use Reinforcement Learning to improve online stacking heuristics.…”
Section: Related Workmentioning
confidence: 99%