“…Random forest was chosen as it is relatively easy to inspect feature importance, is fairly robust to data imbalances, and has previously been used to classify behaviour in sheep with a high degree of success (e.g. Kleanthous et al, 2018Kleanthous et al, , 2019Kleanthous et al, , 2020Lush et al, 2018;Mansbridge et al, 2018;Walton et al, 2018;Kaler et al, 2020). When training the classifier, we specified an mtry (the number of variables tried at each split) equal to the square root of the number of variables used for classification (approximately 1.7) and ntree (the number of trees grown) of 500 (Barwick et al, 2018b;Fogarty et al, 2020a).…”