Enhancing Chronic Kidney Disease Classification Using Decision Tree And Bootstrap Aggregating: Uci Dataset Study With Improved Accuracy And Auc-Roc

Authors

  • Zuriati Department of Internet Engineering Technology, Politeknik Negeri Lampung, Indonesia
  • Dian Meilantika Department of Information Management, Politeknik Negeri Lampung, Indonesia
  • Atika Arpan Department of Information Management, Politeknik Negeri Lampung, Indonesia
  • Rizka Permata Department of Information Management, Politeknik Negeri Lampung, Indonesia
  • Sriyanto Department of Informatics, Institute of Informatics and Business Darmajaya, Indonesia
  • Mohd. Zaki Mas'ud Faculty of Artificial Intelligence and Cyber Security, Universiti Teknikal Malaysia Melaka, Malaysia

DOI:

https://doi.org/10.52436/1.jutif.2025.6.5.5271

Keywords:

Bootstrap Aggregating, Chronic Kidney Disease, Classification, Decision Tree, Ensemble Learning

Abstract

Chronic Kidney Disease (CKD) is a progressive medical disorder that requires timely and precise identification to avoid permanent impairment of kidney function. However, Decision Tree models, although widely used in clinical applications due to their transparency, ease of implementation, and ability to handle both categorical and numerical data, are prone to overfitting and instability when applied to small or imbalanced datasets. The purpose of this study is to optimize CKD classification by integrating Bootstrap Aggregating (Bagging) with Decision Tree to enhance accuracy and robustness. The methodology involves testing two model variants a standalone Decision Tree and a Bagging-supported Decision Tree using 10-fold cross-validation and evaluating performance with accuracy, precision, recall, F1-score, and the area under the ROC curve (AUC-ROC). Findings reveal that Bagging enhances model accuracy from 0.980 to 0.987, raises precision from 0.976 to 1.000, and improves recall from 0.954 to 0.954, and increases F1-score from 0.965 to 0.976. These results demonstrate that Bagging significantly improves the reliability and generalizability of Decision Tree classifiers, making them more effective for CKD prediction.

Downloads

Download data is not yet available.

References

E. Cavalier, T. Zima, P. Datta, and K. Makris, “Recommendations for European laboratories based on the KDIGO 2024 Clinical Practice Guideline for the Evaluation and Management of Chronic Kidney Disease,” Clinical Chemistry and Laboratory Medicine (CCLM), vol. 63, no. 3, pp. 525–534, Feb. 2025, doi: 10.1515/cclm-2024-1082.

C. Ke, J. Liang, M. Liu, S. Liu, and C. Wang, “Burden of chronic kidney disease and its risk-attributable burden in 137 low-and middle-income countries, 1990–2019: results from the global burden of disease study 2019,” BMC Nephrology, vol. 23, no. 1, pp. 1–12, 2022, doi: 10.1186/s12882-021-02597-3.

J. Guo, Z. Liu, P. Wang, and H. Wu, “Global, regional, and national burden inequality of chronic kidney disease, 1990–2021: a systematic analysis for the global burden of disease study 2021,” Frontiers in Medicine, vol. 11, no. January, pp. 1–25, Jan. 2025, doi: 10.3389/fmed.2024.1501175.

B. Bikbov, C. A. Purcell, A. S. Levey, and M. Smith, “Global, regional, and national burden of chronic kidney disease, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017,” The Lancet, vol. 395, no. 10225, pp. 709–733, Feb. 2020, doi: 10.1016/S0140-6736(20)30045-3.

Y. Lai, H. Long, Z. Liang, and C. Wu, “Global burden and risk factors of chronic kidney disease due to hypertension in adults aged 20 plus years, 1990–2021,” Frontiers in Public Health, vol. 13, no. May, pp. 1–12, May 2025, doi: 10.3389/fpubh.2025.1503837.

X. Wang, Z. Liu, N. Yi, and L. Li, “The global burden of chronic kidney disease due to glomerulonephritis: trends and predictions,” International Urology and Nephrology, Mar. 2025, doi: 10.1007/s11255-025-04440-2.

A. Makmun, B. Satirapoj, D. G. Tuyen, and M. W. Y. Foo, “The burden of chronic kidney disease in Asia region: a review of the evidence, current challenges, and future directions.,” Kidney research and clinical practice, vol. 44, no. 3, pp. 411–433, 2024, doi: 10.23876/j.krcp.23.194.

N. M. Hustrini, E. Susalit, F. F. Widjaja, and A. I. Khumaedi, “The Etiology of Advanced Chronic Kidney Disease in Southeast Asia: A Meta-analysis,” Journal of Epidemiology and Global Health, vol. 14, no. 3, pp. 740–764, 2024, doi: 10.1007/s44197-024-00209-5.

P. Mahajan, S. Uddin, F. Hajati, and M. A. Moni, “Ensemble Learning for Disease Prediction: A Review,” Healthcare, vol. 11, no. 12, p. 1808, Jun. 2023, doi: 10.3390/healthcare11121808.

F. Keshvari-Shad, S. Hajebrahimi, and M. P. Laguna Pes, “A Systematic Review of Screening Tests for Chronic Kidney Disease: An Accuracy Analysis,” Galen Medical Journal, vol. 9, p. e1573, 2020, doi: 10.31661/gmj.v9i0.1573.

B. Zhou, J. Zhang, and G. Li, “The Global, Regional, and National Burden of Pancreatitis From 1990 to 2021: A Systematic Analysis for the Global Burden of Disease Study 2021,” Journal of Gastroenterology and Hepatology, vol. 40, no. 5, pp. 1297–1306, May 2025, doi: 10.1111/jgh.16906.

S. R. Kalupukuru and K. Natarajan, “Machine Learning Methods for Predicting the Prognosis of Chronic Kidney Disease,” Procedia Computer Science, vol. 258, pp. 1372–1382, 2025, doi: 10.1016/j.procs.2025.04.370.

Z. Ullah and M. Jamjoom, “Early Detection and Diagnosis of Chronic Kidney Disease Based on,” Journal of Healthcare Engineering, vol. 2023, 2023.

K. Mendapara, “Development and evaluation of a chronic kidney disease risk prediction model using random forest,” Frontiers in Genetics, vol. 15, no. June, pp. 1–11, Jun. 2024, doi: 10.3389/fgene.2024.1409755.

M. Garonga and Rita Tanduk, “Comparison of Naive Bayes, Decision Tree, and Random Forest Algorithms in Classifying Learning Styles of Universitas Kristen Indonesia Toraja Students,” Jurnal Teknik Informatika (Jutif), vol. 4, no. 6, pp. 1507–1514, 2023, doi: 10.52436/1.jutif.2023.4.6.1020.

M. K. Biddinika, A. Masitha, H. Herman, and V. A. N. Fatimah, “Machine Learning Techniques for Heart Disease Prediction Using a Multi-Algorithm Approach,” JUITA: Jurnal Informatika, vol. 12, no. 2, p. 149, 2024, doi: 10.30595/juita.v12i2.24153.

G. Blanc, J. Lange, A. Malik, and L. Y. Tan, “Popular decision tree algorithms are provably noise tolerant,” Proceedings of Machine Learning Research, vol. 162, pp. 2091–2106, 2022, doi: https://doi.org/10.48550/arXiv.2206.08899.

M. Alizade-harakiyan, A. Khodaei, A. Yousefi, H. Zamani, and A. Mesbahi, “Decision tree-based machine learning algorithm for prediction of acute radiation esophagitis,” vol. 42, no. January, 2025, [Online]. Available: https://doi.org/10.1016/j.bbrep.2025.101991

D. Bertsimas and V. Digalakis, “Improving Stability in Decision Tree Models,” pp. 1–37, 2023, [Online]. Available: http://arxiv.org/abs/2305.17299

M. M. M. Hassan, M. M. M. Hassan, S. Mollick, and M. A. R. Khan, “A Comparative Study, Prediction and Development of Chronic Kidney Disease Using Machine Learning on Patients Clinical Records,” Human-Centric Intelligent Systems, vol. 3, no. 2, pp. 92–104, 2023, doi: 10.1007/s44230-023-00017-3.

S. Y. Irianto, D. Linda, I. I. M. Rizki, S. Karnila, and D. Yuliawati, “Chronic Kidney Disease Classification Using Bagging and Particle Swarm Optimization Techniques,” International Journal of Advanced Computer Science and Applications, vol. 16, no. 3, pp. 689–698, 2025, doi: 10.14569/IJACSA.2025.0160368.

P. A. Moreno-Sánchez, “Chronic Kidney Disease Early Diagnosis Enhancing by using Data Mining Classification and Features Selection,” pp. 460–467, 2021, doi: https://doi.org/10.1007/978-3-030-69963-5_5.

P. A. Moreno-Sanchez, “Data-Driven Early Diagnosis of Chronic Kidney Disease: Development and Evaluation of an Explainable AI Model,” IEEE Access, vol. 11, no. Ml, pp. 38359–38369, 2023, doi: 10.1109/ACCESS.2023.3264270.

S. Pal, “Chronic Kidney Disease Prediction Using Machine Learning Techniques,” Biomedical Materials and Devices, vol. 1, no. 1, pp. 534–540, 2023, doi: 10.1007/s44174-022-00027-y.

C. Kaur, M. S. Kumar, A. Anjum, M. B. Binda, M. R. Mallu, and M. S. Al Ansari, “Chronic Kidney Disease Prediction Using Machine Learning,” Journal of Advances in Information Technology, vol. 14, no. 2, pp. 384–391, 2023, doi: 10.12720/jait.14.2.384-391.

A. M. Elshewey, E. Selem, and A. H. Abed, “Improved CKD classification based on explainable artificial intelligence with extra trees and BBFS,” Scientific Reports, vol. 15, no. 1, p. 17861, May 2025, doi: 10.1038/s41598-025-02355-7.

D. Chhabra, M. Juneja, and G. Chutani, “An Efficient Ensemble-based Machine Learning approach for Predicting Chronic Kidney Disease,” Current Medical Imaging Reviews, vol. 20, pp. 1–12, 2023, doi: 10.2174/1573405620666230508104538.

D. Saif, A. M. Sarhan, and N. M. Elshennawy, “Early prediction of chronic kidney disease based on ensemble of deep learning models and optimizers,” Journal of Electrical Systems and Information Technology, vol. 11, no. 1, 2024, doi: 10.1186/s43067-024-00142-4.

S. Imran Ali, B. Ali, J. Hussain, M. Hussain, and F. A. Satti, “Cost-Sensitive Ensemble Feature Ranking and Automatic Threshold Selection for Chronic Kidney Disease Diagnosis,” Applied Sciences, vol. 10, no. 16, p. 5663, Aug. 2020, doi: 10.3390/app10165663.

M. H. I. Bijoy, M. J. Mia, M. M. Rahman, M. S. Arefin, P. K. Dhar, and T. Shimamura, A robot process automation based mobile application for early prediction of chronic kidney disease using machine learning, vol. 7, no. 6. Springer International Publishing, 2025. doi: 10.1007/s42452-025-06980-9.

D. A. Debal and T. M. Sitote, “Chronic kidney disease prediction using machine learning techniques,” Journal of Big Data, vol. 9, no. 1, 2022, doi: 10.1186/s40537-022-00657-5.

S. C. Tan, “Decision Tree Regression with Residual Outlier Detection,” Journal of Data Science and Intelligent Systems, vol. 3, no. 2, pp. 126–135, 2024, doi: 10.47852/bonviewjdsis42023861.

Additional Files

Published

2025-10-16

How to Cite

[1]
Z. Zuriati, D. Meilantika, A. Arpan, R. Permata, S. Sriyanto, and M. Z. . Mas’ud, “Enhancing Chronic Kidney Disease Classification Using Decision Tree And Bootstrap Aggregating: Uci Dataset Study With Improved Accuracy And Auc-Roc”, J. Tek. Inform. (JUTIF), vol. 6, no. 5, pp. 3111–3123, Oct. 2025.