“…We demonstrate the accuracy, ease of use, and power of SIMON on five different biomedical datasets and build predictive models for arboviral infection severity (SISA), 42 the identification of the cellular immune signature associated with a high-level of physical activity (Cyclists), 43 the determination of the humoral responses that mediate protection against Salmonella Typhi infection (VAST), 44 early stage detection of colorectal cancer from microbiome data (Zeller), 45 , 46 and the detection of liver hepatocellular carcinoma cells (LIHC) 47 ( Figure 1 B–1E; Supplemental Information , Videos S1 and S6 ). To build models using the SISA dataset containing clinical parameters (described in the Experimental Procedures and available as Table S2 ), 11 ML algorithms were used, 5 from the original publication 42 (treebag, k nearest neighbors, random forest, stochastic generalized boosting model, and neural network) and, in addition, “sda,” shrinkage discriminant analysis; “hdda,” high-dimensional discriminant analysis; “svmLinear2,” support vector machine with linear kernel; “pcaNNet,” neural networks with feature extraction; “LogitBoost,” boosted logistic regression, and naive Bayes.…”