“…Overall, it was not the choice of ML algorithm, but primarily data quality, that impacted the performance the most, which is specifically challenging in fault diagnosis, as there is a severe lack of labeled data. GBR, TPOT Sensor failure Guelpa et al [87] Analytical Fouling Cadei et al [88] ARIMA, RIDGE, one-class SVM Fouling Kim et al [89] kM, MLP Fouling Park et al [90] RF Valves Langroudi et al [91] LR, DTR, RIDGE, kNN, PLS, SVM, RF, LASSO, XGBoost, ANN Pipes Bahlawan et al [92] Analytical Pipes Manservigi et al [93] Analytical Pipes Bode et al [94] LR, kNN, CART, RF, NB, SVM, ANN Multi-label Choi et al [96] AE, MLP Multi-label Li et al [95] kNN, RF, ANN, CNN Multi-label…”