Background
Anemia during pregnancy is a significant public health concern, particularly in resource-limited settings. Machine learning (ML) offers promising avenues for improved anemia detection and management. This study investigates the potential of ML models in predicting anemia severity among pregnant women attending Antenatal Care (ANC) visits in Ethiopia.
Methods
Data from the Ethiopian Demographic Health Survey, specialized hospitals, and public hospitals were utilized. The dataset included individuals diagnosed with severe (65.12%), moderate (15.63%), mild (16.65%) anemia, and non-anemic (2.61%) cases. Feature selection employed filter methods based on mutual information, and F-score was used to assess anemia severity prediction across four classes. Six ML models (MLP-NN, XGBoost, GNB, Decision Tree, Random Forest, and KNN) were evaluated using accuracy, precision, recall, and F1-score.
Results
The Random Forest classifier achieved the best overall performance across all categories, with an accuracy of 97%, precision of 93%, recall of 93%, and F1-score of 93%. This indicates high true positive rates and low false positive rates. While other models like XGBoost, MLP-NN, and Decision Tree showed good performance, they weren’t quite as strong as Random Forest. Classifiers like KNN and GNB had lower overall accuracy and a tendency to misclassify some cases.
Conclusions
This study demonstrates the promising potential of Random Forest in predicting anemia severity among pregnant women in Ethiopia. The findings contribute to a more holistic understanding of anemia risk factors and pave the way for improved early detection and targeted interventions.