“…We constructed these “Laplacian-corrected Naïve Bayesian classifier” models using our previously published protocols. [4,53,55,56,60,73,76,77] Models were internally validated using 5-fold cross-validation in Pipeline Pilot, which involves leaving out a random selection of 20% of the compounds, building a model with the remaining 80% of the dataset, and evaluating the model with the set of 20% that was initially left out. This process was repeated five times, and the following “internal” statistics for the best model were calculated by Pipeline Pilot[50]: ROC rating (Receiver Operator Characteristic curve’s quality), ROC score (the area under the curve of the ROC plot), sensitivity, specificity, and concordance.…”