Background
The number of children with obesity has increased in Saudi Arabia, which is a significant public health concern. Early diagnosis of childhood obesity and screening of the prevalence is needed using a simple in situ method. This study aims to generate statistical equations to predict body fat percentage (BF%) for Saudi children by employing machine learning technology and to establish gender and age-specific body fat reference range.
Methods
Data was combined from two cross-sectional studies conducted in Saudi Arabia for 1,292 boys and girls aged 8–12 years. Body fat was measured in both studies using bio-electrical impedance analysis devices. Height and weight were measured and body mass index was calculated and classified according to CDC 2,000 charts. A total of 603 girls and 374 boys were randomly selected for the learning phase, and 153 girls and 93 boys were employed in the validation set. Analyses of different machine learning methods showed that an accurate, sensitive model could be created. Two regression models were trained and fitted with the construction samples and validated. Gradient boosting algorithm was employed to achieve a better estimation and produce the equations, then the root means squared error (RMSE) equation was performed to decrease the error. Body fat reference ranges were derived for children aged 8–12 years.
Results
For the gradient boosting models, the predicted fat percentage values were more aligned with the true value than those in regression models. Gradient boosting achieved better performance than the regression equation as it combined multiple simple models into a single composite model to take advantage of that weak classifier. The developed predictive model archived RMSE of 3.12 for girls and 2.48 boys. BF% and Fat mass index charts were presented in which cut-offs for 5th, 75th and 95th centiles are used to define ‘under-fat’, ‘normal’, ‘overfat’ and ‘subject with obesity’.
Conclusion
Machine learning models could represent a significant advancement for investigators studying adiposity-related issues in children. These models and newly developed centile charts could be useful tools for the estimation and classification of BF%.