To accurately assess students’ cognitive state of knowledge points in the learning process within the smart classroom, a knowledge tracing (KT) model based on classroom network characteristic learning engagement and temporal-spatial feature fusion (CL-TSKT) is proposed. First, a classroom network is constructed based on the information of the student ID, seating relationship, student–student interaction, head-up or head-down state, and classroom network characteristics obtained from a smart classroom video. Second, a learning engagement model is established by utilizing the student–student interactions, head-up or head-down state, and classroom network characteristics. Finally, according to the learning engagement model and the knowledge point test data, a parallel temporal attention GRU network is proposed. It is utilized to extract the temporal features of the knowledge points and learning engagement. They are fused to obtain the knowledge point-learning engagement temporal characteristics and their associated attributes. Meanwhile, a CNN is used to extract the knowledge point-knowledge point spatial features. We consider the associative properties of knowledge point-knowledge points from a spatial perspective and fuse the knowledge point-knowledge point spatial features with the knowledge point-learning engagement temporal features. To accurately characterize the cognitive state of the knowledge points and provide effective support for teachers’ accurate and sustainable interventions for learners in the teaching and learning process, this paper conducts extensive experiments on four real datasets. The CL-TSKT model in this paper shows superior performance in all four evaluation metrics, compared with the state-of-the-art KT models.