Decision-making plays an essential role in various real-world systems like automatic driving, traffic dispatching, information system management, and emergency command and control. Recent breakthroughs in computer game scenarios using deep reinforcement learning for intelligent decision-making have paved decision-making intelligence as a burgeoning research direction. In complex practical systems, however, factors like coupled distracting features, long-term interact links, and adversarial environments and opponents, make decision-making in practical applications challenging in modeling, computing, and explaining. This work proposes game interactive learning, a novel paradigm as a new approach towards intelligent decision-making in complex and adversarial environments. This novel paradigm highlights the function and role of a human in the process of intelligent decision-making in complex systems. It formalizes a new learning paradigm for exchanging information and knowledge between humans and the machine system. The proposed paradigm first inherits methods in game theory to model the agents and their preferences in the complex decision-making process. It then optimizes the learning objectives from equilibrium analysis using reformed machine learning algorithms to compute and pursue promising decision results for practice. Human interactions are involved when the learning process needs guidance from additional knowledge and instructions, or the human wants to understand the learning machine better. We perform preliminary experimental verification of the proposed paradigm on two challenging decision-making tasks in tactical-level War-game scenarios. Experimental results demonstrate the effectiveness of the proposed learning paradigm.
KEYWORDSdecision-making; game interactive learning; human-computer interaction; game theory; machine learning R eal-world systems like automatic driving [1] , traffic dispatching [2] , information system management [3] , and emergency command and control [4] , inevitably involve the decision-making procedures within all their running process. The quality of the decision-making results, either accomplished by the human or the machine, thus significantly affects these systems' performance. With the fast development of Artificial Intelligence (AI) in the last decades [5,6] , especially the deep learning models and algorithms, we have witnessed significant progress in highperformance intelligent perception models of audio, visual, and text data [7][8][9] . Due to the newly arising deep reinforcement learning algorithms, researchers have also made a substantial advance in developing human-level intelligent decision models in challenging computer games like Go [10] , Pokers [11,12] , and real-time strategy games [13][14][15] . The MuZero model [16] achieves expert-level performance in Go, Chess, Shogi, and Atrai 2600 games simultaneously. These advances have paved decision-making intelligence as a growing research direction for further artificial intelligence developments in more complex sys...