“…For predictive power, the mean AUC of the final models in training group was 0.876±0.09, ranging from 0.741 to 0.989. Internal validation was performed in 14 studies ( 25 - 27 , 29 , 32 - 38 , 40 , 41 , 43 ), and 1 study employed external validation ( 42 ), but only 12 studies ( 25 , 26 , 29 , 32 - 34 , 36 - 38 , 40 , 41 , 43 ) reported the AUCs of the validation groups, ranging from 0.73 to 0.986. Model calibration was investigated in 7 studies (36.8%) and demonstrated good calibration performance ( 25 , 30 , 32 , 37 , 40 , 42 , 43 ).…”