“…Incorporate clinical insights [105] Small sample size Data imputation [32], [59], [114] Interactive user interface [42], [90], [94], [105] Bad data quality Artifact correction [64], [115] Clinical feedback [90], [106] Imbalanced classes Data augmentation [62], [65] Visualization of feature importance [33], [71], [80] Complex disease phenotype Multi-modality data [57], [116] Clustering analysis [67], [87] Data heterogeneity Data normalization [59] Decision tree [4], [32], [33] Lack of expert annotation Weakly supervised learning [51], [71], [87] Use multiple feature importance approaches [13], [41], [74] Unkown sources of signal Key feature extraction [37], [58], [69] Cross validation when comparing models [46], [105], [107], [110] Explanations unclear Pre-processing changes [67], [68] Use appropriate and robust performance metrics (AUROC, MCC) [110], [111] Data leakage Patient-level split [45]…”