In this paper, we propose a computable method to evaluate the aesthetic value of fusion images of folk martial arts and dance based on human visual and aesthetic habits, extract features and use them as evaluation indexes from three aspects: technical features, perceptual features, and social features, establish an aesthetic evaluation model by fusing each index, and determine the influence factors of each index by using online research. The aesthetic evaluation of fusion images of folk martial arts and dance is carried out automatically. The results of the experimental tests on the evaluation dataset of the aesthetic quality of fusion images of folk martial arts and dance at the Communication University of China show that the accuracy of the proposed deep learning algorithm-based method for analyzing the aesthetic index of fusion of folk martial arts and dance can reach 98.08% with certain validity, and the evaluation results of each index are clear and intuitive, which can play a guiding role in improving the aesthetic quality of fusion of folk martial arts and dance from various angles. It can be used as a guide to improving the aesthetics of the fusion of folk martial arts and dance from various perspectives. By integrating the model into the mobile application, it is possible to evaluate and score the aesthetics of multiple portrait photos uploaded by users and select photos to be kept or deleted based on the scoring results, thus simplifying user operations.