With the development of industrial Internet and artificial intelligence, data fusion in crossdomains and cross-layers have become an inevitable trend. Most of the data fusion involved in the production process of hot rolling are concentrated on the level of sensors, Internet of Things (IoT) and the Internet; but human data are not well integrated. In order to avoid the human factor from becoming the bottleneck of the entire production schedule, this paper proposes a ternary data fusion model based on reinforcement learning algorithm. The related data source from human-cyber-physical space includes: social network, Internet and IoT. By merging the ternary data, a variety of data (including humans') can be quickly calculated to obtain better and faster decisions. In order to achieve automated fusion from ternary data, this paper proposes a method based on reinforcement learning: firstly, the domain ontology used for associating ternary data is reduced and tessellated (dimension reduction), and then the reinforcement learning model is used to form "the new ontology". Compared with resource-intensive global calculations (which may cost a few days), the new method can complete the calculations in minutes. This means that the new method optimizes the data source required for decision-making and improves the efficiency. Finally, the production scheduling of hot rolled steel is used as an example to verify the feasibility of the proposed method. INDEX TERMS human-cyber-physical data fusion, ternary data fusion, cyber-physical system, domain ontology, reinforcement learning