Prediction of Anemia Using Machine Learning Algorithms: Scoping Review
Abstract
Introduction: One of the major public health problems is anemia, especially affecting newborn and infant children, adolescent girls, young women, pregnant women, and postpartum women. The cause of anemia is the reduced supply of red blood cells in the human body or the damage or weakening of the structure of red blood cells. One of the preferences of utilizing machine learning is the prediction of results.
Objective: The purpose of this study is to compare effective algorithms, related to the origin or source of the data set, data set size, metric evaluation and accuracy and produce predictors in predicting anemia using machine learning.
Method: This research uses a scoping review method on 4 databases, namely Scopus, EBSCO, PubMed, and IEEE Xplore from 2019 - 2024 with keywords anemia, algorithms, machine learning, and prediction. The results of screening articles on the Scopus, EBSCO, PubMed, and IEEE Xplore databases obtained 384 articles which were then selected through several stages and obtained 9 articles.
Result: The review found that the highest algorithm performance in anemia prediction, namely Penalized Regression (LASSO regression) accuracy above 64%, XGboost accuracy 100% and execution time 0.2404 seconds, Catboost accuracy 97.6%, Random Forest accuracy 95.49% and 72%, J48 algorithm accuracy of 97.7%, Logistic Regression accuracy 66% and AUC 69%, and SVM linear AUC 79.9%.
Conclusion: Machine learning can assist in the development of anemia prediction models by exploring large amounts of data and producing precise and fast predictors. The predictors obtained are determined by the selection of algorithms in the study.
References
World Health Organization. Anaemia [Internet]. 2023 [cited 2024 Sep 16]. Available from: https://www.who.int/news-room/fact-sheets/detail/anaemia
Kementerian Kesehatan RI. Survey Kesehatan Indonesia 2023 Dalam Angka [Internet]. 2023 [cited 2024 Sep 16]. Available from: https://www.badankebijakan.kemkes.go.id/ski-2023-dalam-angka/
Zemariam AB, Yimer A, Abebe GK, Wondie WT, Abate BB, Alamaw AW, et al. Employing supervised machine learning algorithms for classification and prediction of anemia among youth girls in Ethiopia. Sci Rep. 2024 Apr 20;14(1):9080. https://doi.org/10.1038/s41598-024-60027-4
Tartan EO, Berkol A, Ekici Y. Anemia Diagnosis By Using Artificial Neural Networks. Int J Multidiscip Stud Innov Technol. 2020;4(1):14–7.
Ortiz-Prado E, Dunn JF, Vasconez J, Castillo D, Viscor G. Review Article: Partial pressure of oxygen in the human body: a general review. Am J Blood Res. 2019;9(1):1–14.
Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019 Dec;19(1):64. https://doi.org/10.1186/s12874-019-0681-4
Khan JR, Chowdhury S, Islam H, Raheem E. Machine Learning Algorithms To Predict The Childhood Anemia In Bangladesh. J Data Sci. 2019 Jan 17;17(1):195–218. https://doi.org/10.6339/JDS.201901_17(1).0009
Pullakhandam S, McRoy S. Classification and Explanation of Iron Deficiency Anemia from Complete Blood Count Data Using Machine Learning. BioMedInformatics. 2024 Mar 1;4(1):661–72. https://doi.org/ 10.3390/biomedinformatics4010036
Kassaw AK, Yimer A, Abey W, Molla TL, Zemariam AB. The application of machine learning approaches to determine the predictors of anemia among under five children in Ethiopia. Sci Rep. 2023 Dec 21;13(1):22919. https://doi.org/10.1038/s41598-023-50128-x
Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inform Decis Mak. 2019 Dec;19(1):281. https://doi.org/10.1186/s12911-019-1004-8
Meitei AJ, Saini A, Mohapatra BB, Singh KhJ. Predicting child anaemia in the North-Eastern states of India: a machine learning approach. Int J Syst Assur Eng Manag. 2022 Dec;13(6):2949–62. https://doi.org/10.1007/s13198-022-01765-4
Shweta N, Pande SD. Prediction of Anemia using various Ensemble Learning and Boosting Techniques. EAI Endorsed Trans Pervasive Health Technol 2023 Oct 20;9. Available from: https://publications.eai.eu/index.php/phat/article/view/4197. https://doi.org/10.4108/eetpht.9.4197
Dejene BE, Abuhay TM, Bogale DS. Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm. BMC Med Inform Decis Mak. 2022 Sep 22;22(1):247. https://doi.org/10.1186/s12911-022-01992-6
Maasthi MJ, Gururaj HL, Ravi V, D B, Almeshari M, Alzamil Y. Decision-making Support System for Predicting and Eliminating Malnutrition and Anemia. Open Bioinforma J. 2023 Oct 27;16(1):e18750362246898. https://doi.org/10.2174/0118750362246898230921054021
Göl M, Aktürk C, Talan T, Vural MS, Türkbeyler ?H. Predicting malnutrition?based anemia in geriatric patients using machine learning methods. J Eval Clin Pract. 2024 Sep 23;jep.14142. https://doi.org/10.1111/jep.14142
Tesfaye SH, Seboka BT, Sisay D. Application of machine learning methods for predicting childhood anaemia: Analysis of Ethiopian Demographic Health Survey of 2016. Moinuddin M, editor. PLOS ONE. 2024 Apr 11;19(4):e0300172. https://doi.org/10.1371/journal.pone.0300172
Pan Y, Du R, Han X, Zhu W, Peng D, Tu Y, et al. Machine Learning Prediction of Iron Deficiency Anemia in Chinese Premenopausal Women 12 Months after Sleeve Gastrectomy. Nutrients. 2023 Jul 30;15(15):3385. https://doi.org/10.3390/nu15153385
World Health Organization. Anaemia in women and children [Internet]. 2021 [cited 2024 Oct 9]. Available from: https://www.who.int/data/gho/data/themes/topics/anaemia_in_women_and_children
Singh S, Parihar S. Prevalence of anemia in under five-year-old children: a hospital-based study. Int J Contemp Pediatr. 2019 Feb 23;6(2):842. https://doi.org/10.18203/2349-3291.ijcp20190740
Anand P, Gupta R, Sharma A. Prediction of Anaemia among children using Machine Learning Algorithms. Int J Electron Eng. 2019 Jun;11(2):469–80.
Kumar K, Kumar P, Deb D, Unguresan ML, Muresan V. Artificial Intelligence and Machine Learning Based Intervention in Medical Infrastructure: A Review and Future Trends. Healthcare. 2023 Jan 10;11(2):207. https://doi.org/10.3390/healthcare11020207
Gaurav K, Kumar A, Singh P, Kumari A, Kasar M, Suryawanshi T. Human Disease Prediction using Machine Learning Techniques and Real-life Parameters. Int J Eng. 2023;36(6):1092–8. https://doi.org/10.5829/IJE.2023.36.06C.07
Dukhi N, Sewpaul R, Derrick Sekgala M, Olawale Awe O. Artificial Intelligence Approach for Analyzing Anaemia Prevalence in Children and Adolescents in BRICS Countries: A Review. Curr Res Nutr Food Sci J. 2021 Apr 30;9(1):01–10. https://doi.org/10.12944/CRNFSJ.9.1.01
Wallner C, Hurst J, Behr B, Rony MAT, Barabás A, Smith G. Fanconi Anemia: Examining Guidelines for Testing All Patients with Hand Anomalies Using a Machine Learning Approach. Children. 2022 Jan 7;9(1):85. https://doi.org/10.3390/children9010085
Zahirzada A, Zaheer N, Shahpoor MA. Machine Learning Algorithms to Predict Anemia in Children Under the Age of Five Years in Afghanistan: A Case of Kunduz Province. J Surv Fish Sci. 2023;10(4S):752–62.
Chen RC, Dewi C, Huang SW, Caraka RE. Selecting critical features for data classification based on machine learning methods. J Big Data. 2020 Dec;7(1):52. https://doi.org/10.1186/s40537-020-00327-4
Luque A, Carrasco A, Martín A, De Las Heras A. The impact of class imbalance in classification performance metrics based on the binary confusion matrix. Pattern Recognit. 2019 Jul;91:216–31. https://doi.org/10.1016/j.patcog.2019.02.023
Greenwood CJ, Youssef GJ, Letcher P, Macdonald JA, Hagg LJ, Sanson A, et al. A comparison of penalised regression methods for informing the selection of predictive markers. Kotozaki Y, editor. PLOS ONE. 2020 Nov 20;15(11):e0242730. https://doi.org/10.1371/journal.pone.0242730
Liu J, Ma Y, Xie W, Li X, Wang Y, Xu Z, et al. Lasso-Based Machine Learning Algorithm for Predicting Postoperative Lung Complications in Elderly: A Single-Center Retrospective Study from China. Clin Interv Aging. 2023 Apr;Volume 18:597–606. https://doi.org/10.2147/CIA.S406735
Yeruva S, Varalakshmi MS, Gowtham BP, Chandana YH, Prasad PesnK. Identification of Sickle Cell Anemia Using Deep Neural Networks. Emerg Sci J. 2021 Apr 1;5(2):200–10. https://doi.org/10.28991/esj-2021-01270
Christian Y. Comparison of Machine Learning Algorithms Using WEKA and Sci-Kit Learn in Classifying Online Shopper Intention. J Inform Telecommun Eng. 2019 Jul 25;3(1):58–66. https://doi.org/10.31289/jite.v3i1.2599
Copyright (c) 2024 Media Publikasi Promosi Kesehatan Indonesia (MPPKI)
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with Journal of Public Health and Pharmacy retain the copyright of their work. The journal applies a Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0), which grants the following rights:
-
Copyright Retention: Authors retain the copyright of their work, maintaining full control over their intellectual property without restrictions.
-
Right of First Publication: Authors grant the journal the right of first publication of their work. This ensures that the work is initially published and credited in Journal of Public Health and Pharmacy.
-
License to Share and Reuse: The work is licensed under CC BY-SA 4.0, allowing others to copy, distribute, remix, and build upon the work for any purpose, even commercially, as long as proper credit is given to the authors, and any new creations are licensed under the same terms.