Mon–Fri 10:00–17:00 IST
IJMEM Logo
International Journal of Modern Engineering and Management | IJMEM
Multidisciplinary
Open Access Journal
ISSN No: 3048-8230
Follows UGC–CARE Guidelines
Home Scope Indexing Publication Charges Archives Editorial Board Downloads Contact Us

Predicting Diabetes in Bangladesh Using Machine Learning: A Data-Driven Approach

Author(s):

Tonny Shekha Kar1, Jothirmoy Sarker Shuvo2

Affiliation: 1,2Department of Computer Science, 3Department of Electrical & Electronic Engineering, 1,2,3American International University Bangladesh, Bangladesh

Page No: 39-47-

Volume issue & Publishing Year: Volume 2 Issue 4,April-2025

Journal: International Journal of Modern Engineering and Management | IJMEM

ISSN NO: 3048-8230

DOI:

Abstract:

Diabetes is a chronic condition caused by inadequate insulin synthesis, which prevents the body from processing blood sugar. Diabetes' etiology is still mostly unknown. Furthermore, diabetes is not routinely measured. Since there is no known treatment for diabetes, it is crucial for those who have disease to monitor their blood sugar levels in order to manage and maintain their health. Therefore, safeguards against diabetes or early detection are necessary. Diabetes symptoms can occur suddenly, and it can be mild so it can take years to notice. Diabetes can damage the heart, eyes, kidneys, nerves and damage blood vessels over time. As of today, no known means to prevent diabetes nor its cause are known so early diagnosis or predicting diabetes is necessary to prevent the worst effects of diabetes. Additionally, People in Bangladesh are still ignorant about diabetes and unaware exactly when diabetes has to be measured. In this paper, we are going to predict diabetes by using machine learning. We compared conventional machine learning with deep learning approaches. For the conventional machine learning method, we considered the most commonly used classifiers: K-Nearest Neighbors (KNN) and Random Forest (RF). On the other hand, for Deep Learning (DL) we employed a fully Artificial Neural Networks (ANN) and Long Short-Term Memory (LSTM) Algorithm to predict and detect diabetes patients.

Keywords:

Artificial Neural Networks, Diabetes Prediction, Deep Learning, Random Forest. 

Reference:

  • [1] Magliano, D. J., Boyko, E. J., & International Diabetes Federation. (2025). IDF Diabetes Atlas 11th Edition – 2025. In Optima (Ed.), IDF Diabetes Atlas (11th ed., pp. 84–85) [Report]. https://diabetesatlas.org.
    [2] GBD 2021 Diabetes Collaborators. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: A systematic analysis for the Global Burden of Disease Study 2021. Lancet, 402(10397), 203–234, 2023. doi:10.1016/S0140-6736(23)01301-6.
    [3] Islam Ayon, S., Islam, M. M., & Islam, M. M. (2019). Diabetes prediction: A deep learning approach. International Journal of Information Engineering and Electronic Business, 11(2), 21–27.
    [4] Khan, F. A., Zeb, K., AL-Rakhami, M., Derhab, A., & Bukhari, S. A. C. (2021). Detection and prediction of diabetes using data mining: A comprehensive review. IEEE Access, 9, 43711–43735.
    [5] American Diabetes Association. (2014). Diagnosis and classification of diabetes mellitus. Diabetes Care, 37(1), 81–90.
    [6] Djerioui, M., Brik, Y., Ladjal, M., & Attallah, B. (2019). Neighborhood component analysis and support vector machines for heart disease prediction. Ingénierie des Systèmes d’Information, 24(6), 591–595.
    [7] World Health Organization (WHO). (2024). Diabetes. Retrieved April 17, 2025, from https://www.who.int/news-room/fact-sheets/detail/diabetes.
    [8] Roglic, G. (2016). WHO Global report on diabetes: A summary. International Journal of Noncommunicable Diseases, 1(1), 3–8.
    [9] Nazirun, N. N. N. et al. (2024). Prediction models for type 2 diabetes progression: A systematic review. IEEE Access, 12, 161595–161619.
    [10] Khan, F. A., Zeb, K., Al-Rakhami, M., Derhab, A., & Bukhari, S. A. C. (2021). Detection and prediction of diabetes using data mining: A comprehensive review. IEEE Access, 9, 43711–43735.
    [11] Ayon, S., & Islam, M. (2019). Diabetes prediction: A deep learning approach. International Journal of Information Engineering and Electronic Business, 11, 21–27.
    [12] Shetty, D., Rit, K., Shaikh, S., & Patil, N. (2017). Diabetes disease prediction using data mining. In ICIIECS 2017 International Conference, 1–5.
    [13] Santhanam, T., & Padmavathi, M. S. (2015). Application of K-Means and genetic algorithms for dimension reduction by integrating SNM for diabetes diagnosis. Procedia Computer Science, 47, 76–83.
    [14] Sisodia, D., & Sisodia, D. S. (2018). Prediction of diabetes using classification algorithms. Procedia Computer Science, 132, 1578–1585.
    [15] Perveen, S., Shahbaz, M., Guergachi, A., & Keshavjee, K. (2016). Performance analysis of data mining classification techniques to predict diabetes. Procedia Computer Science, 82, 115–121.
    [16] Nai-Arun, N., & Sittidech, P. (2014). Ensemble learning model for diabetes classification. Advanced Materials Research, 931–932, 1427–1431.
    [17] Orabi, K. M., Kamal, Y. M., & Rabah, T. M. (2016). Early predictive system for diabetes mellitus disease. In Industrial Conference on Data Mining. Springer.
    [18] Priyam, A., Gupta, R., Rathee, A., & Srivastava, S. (2013). Comparative analysis of decision tree classification algorithms. International Journal of Current Engineering and Technology, 3, 334–337.
    [19] Esposito, F., Malerba, D., Semeraro, G., & Kay, J. (1997). A comparative analysis of methods for pruning decision trees. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, 476–491.
    [20] Kayaer, K., & Tulay, M. (2003). Medical diagnosis on Pima Indian diabetes using general regression neural networks. In ICANN/ICONIP, 181–184.
    [21] Shivakumar, B. L., & Alby, S. (2014). A survey on data-mining technologies for prediction and diagnosis of diabetes. In International Conference on Intelligent Computing Applications, 167–173.
    [22] Kavakiotis, I., Tsave, O., Salifoglou, A., Maglaveras, N., Vlahavas, I., & Chouvarda, I. (2017). Machine learning and data mining methods in diabetes research. Computational and Structural Biotechnology Journal, 15, 104–116.
    [23] Rajalakshmi, K., & Dhenakaran, D. S. S. (2015). Analysis of data mining prediction techniques in healthcare management system. IJARCSSE, 5(4), 1343–1347.
    [24] Visalatchi, G., Gnanasoundhari, S. J., & Balamurugan, M. (2014). A survey on data mining methods and techniques for diabetes mellitus. IJCSMA, 2(2), 100–105.
    [25] Tomar, D., & Agarwal, S. (2013). A survey on data mining approaches for healthcare. International Journal of Bio-Science and Bio-Technology, 5(5), 241–266.
    [26] Marinov, M., Mosa, A. S. M., Yoo, I., & Boren, S. A. (2011). Data-mining technologies for diabetes: A systematic review. Journal of Diabetes Science and Technology, 5(6), 1549–1556.
    [27] Tripathi, G., & Kumar, R. (2020). Early prediction of diabetes mellitus using machine learning. In 8th ICRITO Conference, 1009–1014.
    [28] Jian, Y., Pasquier, M., Sagahyroon, A., & Aloul, F. (2021). A machine learning approach to predicting diabetes complications. Healthcare, 9(12), 1712.
    [29] Pavani, K., Anjaiah, P., Krishna Rao, N. V., Deepthi, Y., Noel, D., & Lokesh, V. (2020). Diabetes prediction using machine learning techniques: A comparative analysis. In Energy Systems, Drives and Automations, 419–428.
    [30] Schölkopf, B. (2022). Causality for machine learning. In Probabilistic and Causal Inference: The Works of Judea Pearl, ACM, 765–804.
    [31] Sisodia, D., Singh, L., & Sisodia, S. (2012). Fast and accurate face recognition using SVM and DCT. In SocProS 2012, Springer, 1027–1038.
    [32] Greeshma, U., & Annalakshmi, S. (2015). Artificial neural network: Research paper on basics of ANN. International Journal of Scientific and Engineering Research, 110–115.
    [33] Kannadasan, K., Edla, D. R., & Kuppili, V. (2019). Type-2 diabetes data classification using stacked autoencoders. Clinical Epidemiology and Global Health, 7(4), 530–535.
    [34] Ali, J., Khan, R., Ahmad, N., & Maqsood, I. Random Forest and Decision Trees.
    [35] Kirasich, K., Smith, T., & Sader, B. (2018). Random forest vs. logistic regression: Binary classification for heterogeneous datasets. SMU Data Science Review, 1(3), 9.
    [36] UIU Datasets Repository System. (2025). https://data.uiu.ac.bd/dataset/iriic/1.

Download PDF