“…SL jointly employs pre-labeled data, e.g., MCI versus healthy subjects, and additional features derived from clinical or neuroimaging sources to determine which feature predicts the pre-labeled data the most (Dwyer et al, 2018;Graham et al, 2020). SL operates with probabilistic and non-probabilistic classifiers (Naïve Bayes and Support Vector Machine, respectively), as well as with decision tree, linear, and logistic regression (Dhall and Kaur, 2020). UL techniques, instead, sets unlabeled and unstructured data, e.g., clinical notes, as a starting point to seek relationships or patterns and to learn general representations that enable the automatic extraction of information when building predictors (Miotto et al, 2017;Dwyer et al, 2018;Graham et al, 2020).…”