Background: There has been a considerable increase in total joint arthroplasty (TJA) research using machine learning (ML). Therefore, the purposes of this study were to synthesize the applications and efficacies of ML reported in the TJA literature, and to assess the methodological quality of these studies.Methods: PubMed, OVID/MEDLINE, and Cochrane libraries were queried in January 2021 for articles regarding the use of ML in TJA. Study demographics, topic, primary and secondary outcomes, ML model development and testing, and model presentation and validation were recorded. The TRIPOD (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis) guidelines were used to assess the methodological quality.Results: Fifty-five studies were identified: 31 investigated clinical outcomes and resource utilization; 11, activity and motion surveillance; 10, imaging detection; and 3, natural language processing. For studies reporting the area under the receiver operating characteristic curve (AUC), the median AUC (and range) was 0.80 (0.60 to 0.97) among 26 clinical outcome studies, 0.99 (0.83 to 1.00) among 6 imaging-based studies, and 0.88 (0.76 to 0.98) among 3 activity and motion surveillance studies. Twelve studies compared ML to logistic regression, with 9 (75%) reporting that ML was superior. The average number of TRIPOD guidelines met was 11.5 (range: 5 to 18), with 38 (69%) meeting greater than half of the criteria. Presentation and explanation of the full model for individual predictions and assessments of model calibration were poorly reported (<30%).
Conclusions:The performance of ML models was good to excellent when applied to a wide variety of clinically relevant outcomes in TJA. However, reporting of certain key methodological and model presentation criteria was inadequate. Despite the recent surge in TJA literature utilizing ML, the lack of consistent adherence to reporting guidelines needs to be addressed to bridge the gap between model development and clinical implementation.T he application of artificial intelligence (AI) within orthopaedic surgery research has undergone a substantial increase in recent years [1][2][3][4][5][6][7][8][9][10][11][12] . Machine learning (ML) represents a subset within AI that includes algorithms that autonomously learn from data and make predictions without explicit programmed instructions. The adoption of ML in orthopaedic research has been a function of increasing data availability and several purported benefits including (1) optimization of model performance through "learning," (2) avoidance of limitations inherent to traditional regression analyses, and (3) ability to integrate individualized patient elements to provide individu-alized explanations of anticipated outcomes 6,[13][14][15] . Despite the proliferation of ML literature and the theoretical benefits of ML, it is important to assess whether these analyses have resulted in meaningful results that may influence clinical practice.Current investigations using ML within the total j...