A learning-based dynamic routing algorithm is proposed for the overhead hoist transport (OHT) systems of semiconductor fabrication facilities (fabs). An OHT system, which consists of multiple vehicles moving at high speeds on guided rails, is the primary automated material-handling system (AMHS) in a fab. Modern large-scale fabs have hundreds of vehicles moving lots between multiple processing machines. The dynamic routing method is a route guidance method that dynamically selects the best vehicle paths under given traffic conditions and congestion levels. Building on the Q(λ) learning method, we develop a reinforcement learning-based dynamic routing algorithm called QLBWR(λ), which consists of a Boltzmann softmax policy and a reward function. The proposed algorithm uses real-time information to effectively guide each vehicle so that it avoids congestion and finds an efficient path. The algorithm is also designed with a low computational burden, such that the efficient route can be found for hundreds of vehicles in real time. Simulation analyses on an actual fab layout are used to compare the performance of the proposed algorithm with common static and dynamic algorithms. The results show that the proposed algorithm outperforms the benchmarking algorithms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.