This study aims to enhance the interest of university classrooms and the learning efficiency of students. From deep learning, gesture recognition algorithms (GRA) and long short-term memory (LSTM) are adopted to build prediction modules. After the system process is completed, a brand-new human-computer interaction (HCI) system is established. The deep learning GRA augments the operation of the recognition algorithm of the pattern matching method created on the previous image segmentation. Its enticements on two merits of gesture estimation are as follows: rapid detection of gesture joint features and convolutional neural network (CNN) image classification, thereby improving GRA. The recognition rates of “take”, “pinch”, and “point” are 98.6%, 99.5%, and 99.4%, respectively. When the data volume of the LSTM network model is less than 10, the prediction accuracy can reach 70%, and the performance is relatively stable. The designed HCI system can better recognize the teacher’s gesture intent in teaching, execute gesture commands, and optimize the interactive teaching method between the teacher and the computer. This system is a derivative form of gamification teaching. While it improves the interest in teaching, it is also of great significance for improving the teaching efficiency of general higher education.