“…To address this challenge, the field of ML has developed advanced strategies [ 39 , 40 ]. Class balancing techniques, such as random sampling [ 41 ], synthetic minority oversampling technique (SMOTE) [ 42 , 43 ], and other methodologies [ 44 ], have shown promise in reducing data disparities through improved ML model training [ 45 ]. For instance, these approaches have already been used in a variety of research contexts, including medical diagnosis, gait and image analysis [ 46 , 47 , 48 , 49 , 50 ], and have shown significant improvements in disease detection and clinical outcome prediction [ 51 , 52 ].…”