Optimizing machine learning models for predicting anemia among under-five children in Ethiopia: insights from Ethiopian demographic and health survey data
Yimer A, Yesuf HA, Ahmed S, Zemariam AB, Mussa E, Sirage N, Yesuf A, Kassaw AK
BACKGROUND: Healthcare practitioners require a robust predictive system to accurately diagnose diseases, especially in young children with conditions such as anemia. Delays in diagnosis and treatment can have severe consequences, potentially leading to serious complications and childhood mortality. By leveraging machine learning methods with extensive datasets, valuable and scientifically sound insights can be generated to address pressing health and healthcare-related challenges.
OBJECTIVES: The primary objective of this study was to identify the most effective machine-learning algorithm for predicting anemia among under five children in Ethiopia.
METHODS: The data utilized in this study were sourced from the 2016 Ethiopian Demographic and Health Survey. Six machine-learning models, comprising a classic logistic regression model along with random forest, decision tree, support vector machine, Naïve Bayes, and K-nearest neighbors, were employed to predict factors influencing anemia in children under five. The predictive capacities of each machine-learning model were evaluated using receiver operating characteristic curves and various measures of model accuracy.
RESULTS: The random forest model demonstrated the highest accuracy among the algorithms tested, achieving an overall accuracy of 81.16%. The accuracy rates for the decision tree, support vector machines, Naïve Bayes, K-nearest neighbors, and classical logistic regression models were 68.40%, 59.94%, 53.06%, 69.96%, and 54.79%, respectively.
CONCLUSION: In general, the random forest algorithm emerged as the preferred model for predicting anemia in children under five. The model exhibited a specificity of 79.26%, sensitivity of 83.07%, positive predictive value of 80.02%, negative predictive value of 82.40%, and an area under the curve of 81.80%.
© 2025. The Author(s).
BMC pediatrics, 2025-04-24