“…We have publicly published a MATLAB toolbox for stability selection that can be used to replicate the approach that we have taken here (i.e., elastic net), as well as to implement 14 other classification and regression algorithms that leverage MATLAB’s machine learning toolbox. We additionally see this package as an opportunity to highlight to researchers the dangers of data leakage that have become problematically common in neuroimaging studies (Poulin et al, 2019; Eitel et al, 2021; Kambeitz et al, 2015; Pulini et al, 2019; Whelan & Garavan, 2014; Mateos-Perez et al, 2018; Yagis et al, 2021; Kapoor & Narayanan, 2022; Rosenblatt et al, 2023; 2023; Poldrack et al, 2019), and package our toolbox with a variety of tutorials for implementing appropriate cross-validation with feature selection. We anticipate this package will remain useful even as clinical neuroimaging datasets grow in size as it is well-established that even in datasets with a limited number of features or more equal ratio of features to samples, removing redundant or noisy features can improve model estimation and performance by reducing the amount of noise that is available for the model to overfit (e.g., Bzdok et al, 2018; Hawkins, 2004; Heinze et al, 2018).…”