Currently, production management of cutting-edge IT products takes place in a challenging environment of technological complexity and rapid market changes. In this environment, production control is increasingly important and has become a key factor in determining product quality, cost, and delivery time. High-tech IT product production lines are comprised of a complex of various processes and technologies, so problems that may arise at each stage can have a significant impact on the final product. Therefore, an efficient production management system plays an important role in minimizing risks that may occur during the manufacturing process, maintaining product quality, and reducing production costs. In this study, we review existing studies focusing on reinforcement learning theory among machine learning theories in the field of production management, especially scheduling, and suggest limitations of those studies and future research directions.