In this paper, we propose a multiresidual module convolutional neural network-based method for athlete pose estimation in sports game videos. The network firstly designs an improved residual module based on the traditional residual module. Firstly, a large perceptual field residual module is designed to learn the correlation between the athlete components in the sports game video within a large perceptual field. A multiscale residual module is designed in the paper to better solve the inaccuracy of the pose estimation due to the problem of scale change of the athlete components in the sports game video. Secondly, these three residual modules are used as the building blocks of the convolutional neural network. When the resolution is high, the large perceptual field residual module and the multiscale residual module are used to capture information in a larger range as well as at each scale, and when the resolution is low, only the improved residual module is used. Finally, four multiresidual module convolutional neural networks are used to form the final multiresidual module stacked convolutional neural network. The neural network model proposed in this paper achieves high accuracy of 89.5% and 88.2% on the upper arm and lower arm, respectively, so the method in this paper reduces the influence of occlusion on the athlete’s posture estimation to a certain extent. Through the experiments, it can be seen that the proposed multiresidual module stacked convolutional neural network-based method for athlete pose estimation in sports game videos further improves the accuracy of athlete pose estimation in sports game videos.