Objective Sensitivity and specificity of SLE classification criteria may vary in different populations and clinical settings. In this study, we aimed to compare the performances of three criteria sets/rules (1997, 2012, and 2019) in a large cohort of patients and relevant diseased controls. Methods The medical records of consecutive SLE patients and diseased controls were reviewed for clinical and laboratory features relevant to all sets of criteria. Criteria sets/rules were analyzed based on sensitivity, positive predictive value, specificity, and negative predictive value, using clinical diagnosis with at least 6 months of follow-up as the gold standard. A subgroup analysis was performed in ANA positive patients. Results A total of 393 SLE patients and 308 non-SLE diseased controls were included. Sensitivity was 78.4% for 1997 criteria and was more than 90% for both 2012 (91.9%) and 2019 (94.4%) criteria. Specificity was the highest (95.1%) for 1997 ACR criteria, 91.5% for 2012 SLICC criteria and 91.2% for 2019 EULAR/ACR criteria. When only ANA positive patients were analyzed, sensitivity of each criteria increased by 1%, 0.8%, and 2.2%, respectively. Specificity of 1997 criteria decreased by 2% and specificity of 2012 and 2019 criteria both decreased to less than 90%. Conclusion EULAR/ACR criteria were more sensitive than 1997 criteria and had a comparable performance with SLICC criteria. When only ANA positive patients were analyzed, the presence of false positive results (originated from patients with Sjögren’s disease and antiphospholipid syndrome mainly) decreased the specificity of both SLICC and EULAR/ACR criteria.