Objectives. Evidence suggests that about 80% of all salivary gland tumors involve the parotid glands, with approximately 20% of parotid gland tumors (PGTs) being malignant. Discriminating benign and malignant parotid gland lesions preoperatively is vital for selecting the appropriate treatment strategy. This study explored the diagnostic performance of deep learning system for discriminating benign and malignant PGTs in ultrasonography images and compared it with radiologists. Methods. A total of 251 consecutive patients with surgical resection and proven parotid gland malignant or benign tumors who underwent preoperative ultrasound examinations were enrolled in this study between January 2014 and November 2020. Next, we compared the diagnostic accuracy of deep learning methods (ViT-B\16, EfficientNetB3, DenseNet121, and ResNet50) and radiologists in parotid gland tumor. In addition, the area under the curve (AUC), specificity, sensitivity, positive predictive value, and negative predictive value were calculated. Results. Among the 251 patients, 176/251 were the training set, whereas 75/251 were the validation set. Results showed that 74/251 patients had malignant tumor. Deep learning models achieved good performance in differentiating benign from malignant tumors, with the diagnostic accuracy and AUCs of ViT-B\16, EfficientNetB3, DenseNet121, and ResNet50 model being 81% and 0.81, 80% and 0.82, 77% and 0.81, and 79% and 0.80, respectively. On the other hand, the diagnostic accuracy and AUCs of radiologists were 77%-81% and 0.68-0.75, respectively. It was evident that the diagnostic accuracy of deep learning methods was higher than that of inexperienced radiologists, but there was no significant difference between deep learning methods and experienced radiologists. Conclusions. This study shows that the deep learning system can be used for diagnosing parotid tumors. The findings also suggest that the deep learning system may improve the diagnosis performance of inexperienced radiologists.