Background: With the wide application of digital tongue diagnosis instrument, massive tongue images will be produced. Adequate image quality is the prerequisite to ensure accurate tongue image analysis. In the process of tongue image collection, improper operation may lead to many poor-quality images (fogging, underexposure, overexposure, blurred focus, wrong tongue posture, etc.), which seriously affect the image processing and the accuracy of image analysis. However traditional pattern recognition is difficult to evaluate the quality of tongue images by extracting features and manual removal of tongue images with bad quality consumes a lot of labor and has a high error rate. In this research, we utilized a deep convolutional neural network to automatically select bad quality tongue images.Methods: The present study was conducted to identify the most appropriate CNN model for Tongue Image Quality Assessment based on deep CNN. The CNN model was evaluated by using Residual neural network and compared with VGGNet and DenseNet. Evaluation metrics such as accuracy, precision, recall, and F1-score were used for CNN model performance.Results: A detection model is established for tongue image quality control based on deep residual network, with an average accuracy of 99.04%, accuracy of 99.05%, recall of 99.04%, and F1-score of 99.05%, which can be used for quality screening of massive tongue images.Conclusions: Our research findings demonstrate various CNN models in the decision-making process for the selection of tongue image quality assessment and prove that applying deep learning methods, specifically deep CNN, to evaluate bad quality tongue images is feasible.