BACKGROUND: Differentiating gliomas and primary CNS lymphoma represents a diagnostic challenge with important therapeutic ramifications. Biopsy is the preferred method of diagnosis, while MR imaging in conjunction with machine learning has shown promising results in differentiating these tumors. PURPOSE: Our aim was to evaluate the quality of reporting and risk of bias, assess data bases with which the machine learning classification algorithms were developed, the algorithms themselves, and their performance. DATA SOURCES: Ovid EMBASE, Ovid MEDLINE, Cochrane Central Register of Controlled Trials, and the Web of Science Core Collection were searched according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines.STUDY SELECTION: From 11,727 studies, 23 peer-reviewed studies used machine learning to differentiate primary CNS lymphoma from gliomas in 2276 patients.DATA ANALYSIS: Characteristics of data sets and machine learning algorithms were extracted. A meta-analysis on a subset of studies was performed. Reporting quality and risk of bias were assessed using the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) and Prediction Model Study Risk Of Bias Assessment Tool.
DATA SYNTHESIS:The highest area under the receiver operating characteristic curve (0.961) and accuracy (91.2%) in external validation were achieved by logistic regression and support vector machines models using conventional radiomic features. Meta-analysis of machine learning classifiers using these features yielded a mean area under the receiver operating characteristic curve of 0.944 (95% CI, 0.898-0.99). The median TRIPOD score was 51.7%. The risk of bias was high for 16 studies.LIMITATIONS: Exclusion of abstracts decreased the sensitivity in evaluating all published studies. Meta-analysis had high heterogeneity.CONCLUSIONS: Machine learning-based methods of differentiating primary CNS lymphoma from gliomas have shown great potential, but most studies lack large, balanced data sets and external validation. Assessment of the studies identified multiple deficiencies in reporting quality and risk of bias. These factors reduce the generalizability and reproducibility of the findings.ABBREVIATIONS: AI ¼ artificial intelligence; AUC ¼ area under the receiver operating characteristic curve; CNN ¼ convolutional neural network; ML ¼ machine learning; PCNSL ¼ primary CNS lymphoma; PRISMA ¼ Preferred Reporting Items for Systematic Reviews and Meta-Analyses; PROBAST ¼ Prediction model study Risk Of Bias Assessment Tool; TRIPOD ¼ Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis G liomas are the most common primary malignancy of the CNS. 1 An important differential diagnosis for gliomas is primary CNS lymphoma (PCNSL), a more uncommon but highly malignant neoplasia. 2 Correct differentiation of these tumor entities is an important challenge for clinicians because