Abstract:We study the scenario where some items are stored temporarily in stacks and where it is not allowed to put an item on top of another item leaving earlier. An arriving item is assigned to a stack based only on information on the arrival and departure times for the new item and items currently stored. The objective is to minimize the maximum number of stacks used over time. This problem is referred to as online stacking. We use Reinforcement Learning (RL) techniques to improve heuristics earlier presented in the… Show more
“…Simple heuristics for online stacking similar to Algorithm Online have been presented by Borgman et al [4], Duinkerken et al [8], Hamdi et al [9], and Wang et al [18] without providing a proof of asymptotic optimality. Olsen shows in [12] how Reinforcement Learning can be used to improve simple online stacking heuristics. Finally, we mention the work of Rei and Pedroso [16] and König et al [11] on related problems within the steel industry as well as the PhD thesis by Pacino [14] on container ship stowage.…”
“…Simple heuristics for online stacking similar to Algorithm Online have been presented by Borgman et al [4], Duinkerken et al [8], Hamdi et al [9], and Wang et al [18] without providing a proof of asymptotic optimality. Olsen shows in [12] how Reinforcement Learning can be used to improve simple online stacking heuristics. Finally, we mention the work of Rei and Pedroso [16] and König et al [11] on related problems within the steel industry as well as the PhD thesis by Pacino [14] on container ship stowage.…”
“…Olsen and Gross [14] have developed a polynomial time algorithm for online coloring with a competitive ratio that converges to 1 in probability if the endpoints of the storage time intervals are picked independently and uniformly at random. Olsen [13] has also shown how to use Reinforcement Learning to improve online stacking heuristics.…”
This is the accepted manuscript (post-print version) of the article.Contentwise, the accepted manuscript version is identical to the final published version, but there may be differences in typography and layout.
Stacking is an important process within logistics. Some notable examples of items to be stacked are steel bars or steel plates in a steel yard or containers in a container terminal or on a ship. We say that two items are conflicting if their storage time intervals overlap in which case one of the items needs to be rehandled if the items are stored at the same LIFO storage location. We consider the problem of stacking items using k LIFO locations with a minimum number of conflicts between items sharing a location. We present an extremely simple online stacking algorithm that is oblivious to the storage time intervals and storage locations of all other items when it picks a storage location for an item. The risk of assigning the same storage location to two conflicting items is proved to be of the order 1/k 2 under mild assumptions on the distribution of the storage time intervals for the items. Intuitively, it seems natural to pick a storage location uniformly at random in the oblivious setting implying a risk of 1/k so the risk for our algorithm is surprisingly low. Our results can also be expressed within the context of the MAX k-CUT problem for circle graphs. The results indicate that circle graphs on average have relatively big k-cuts compared to the total number of edges.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.