“…Unlike statistical models, machine learning models tend not to assume anything about the underlying distribution of each feature ( 4 , 5 ). Furthermore, some machine learning models, such as random forest (RF) and related classifiers, are capable of identifying dependencies between features without the need for the user to explicitly include these dependencies in the model ( 11 , 14 – 17 ). One ability, arguably underused, inherent to this class of models is that they can be used in an “unsupervised” manner to learn a dissimilarity function ( 15 , 18 , 19 ).…”