With the rapid development of artificial intelligence, more and more researchers and research institutions begin to pay attention to the bone feature recognition field. Human bone movement is very complex, and human bone shape recognition technology can be widely used in medical treatment, sports, and other fields. At present, there are mainly two kinds of methods for extracting the shape features of human foot bone based on optical image acquisition technology and sensor information perception technology. However, due to the interference factors such as target posture change, camera shake, and individual behavior differences, it is still a very challenging task to design a robust algorithm for extraction and analysis of foot bone shape features. In recent years, convolutional neural network- (CNN-) based foot contour feature recognition methods emerge one after another and have made breakthrough progress. How to use and how to fully explore the potential relationship of various characteristics contained in the foot bone data and how to enhance the robustness of view changes and other aspects need to be further studied. In this context, this paper proposed an improved CNN model, which not only has the capability of deep feature extraction of the CNN model but also can obtain the optimal model parameters with the combination of particle swarm optimization algorithm. The effectiveness of the proposed method in the extraction and analysis of foot bone shape features is verified in the simulation experiment.