Comparison of AdaBoost and Random Forest Methods in Osteoporosis Risk Prediction Based on Machine Learning

Authors

  • Edwardo Parlindungan H Computer Science Department, Universitas Dinamika Bangsa Jambi, Indonesia
  • Setiawan Assegaff Computer Science Department, Universitas Dinamika Bangsa Jambi, Indonesia
  • Jasmir Jasmir Computer Science Department, Universitas Dinamika Bangsa Jambi, Indonesia

DOI:

https://doi.org/10.52436/1.jutif.2026.7.1.5297

Keywords:

AdaBoost, Ensemble methods, Machine learning, Osteoporosis prediction, Random forest, Risk classification

Abstract

This study aims to determine the most effective ensemble machine learning algorithm for osteoporosis risk prediction in resource-constrained healthcare settings, specifically comparing AdaBoost and Random Forest performance on Southeast Asian population data. We implemented nested 5-fold cross-validation on a dataset of 1,958 records with 15 lifestyle and demographic attributes. Both algorithms underwent hyperparameter optimization, and performance was evaluated using accuracy, precision, recall, F1-score, and clinical utility metrics including cost-effectiveness analysis. AdaBoost achieved superior performance with 86.90% accuracy (95% CI: 84.2-89.6%) and perfect precision (1.00) compared to Random Forest's 84.69% accuracy and 0.92 precision. Statistical significance testing confirmed AdaBoost's advantage (p=0.032). Clinical implementation in three health centers demonstrated 60% reduction in unnecessary referrals. This is the first study to compare these algorithms specifically for Southeast Asian populations with clinical validation and cost-effectiveness analysis, providing a ready-to-deploy model for resource-limited healthcare settings.

Downloads

Download data is not yet available.

References

M. Linda, T. Alfianti, and R. Supriyanto, “Perbandingan Kinerja Algoritma Random Forest, AdaBoost, dan XGBoost dalam Memprediksi Risiko Penyakit Osteoporosis,” J. Appl. Data Sci., vol. 4, no. 2, pp. 112–125, 2024.

W. H. Organization, “Assessment of fracture risk and its application to screening for postmenopausal osteoporosis,” WHO Tech. Rep. Ser., vol. 843, pp. 1–129, 2020.

Humaryanto, “Deteksi Dini Osteoporosis Pasca Menopause,” Indones. J. Med. Res., vol. 15, no. 3, pp. 45–52, 2023.

R. R. Pratama, “Analisis Model Machine Learning Terhadap Pengenalan Aktifitas Manusia,” MATRIK J. Manajemen, Tek. Inform. dan Rekayasa Komput., vol. 9, no. 2, pp. 302–311, 2020.

N. Sharfina and N. Ghaniaviyanto, “Analisis SMOTE Pada Klasifikasi Hepatitis C Berbasis Random Forest dan Naïve Bayes,” J. Inf. Technol. Comput. Sci., vol. 6, no. 3, pp. 167–175, 2022.

E. J. Topol, “High-performance medicine: the convergence of human and artificial intelligence,” Nat. Med., vol. 25, no. 1, pp. 44–56, 2019.

T. Z. Jasman, M. A. Fadhlullah, A. L. Pratama, and R. Rismayani, “Analisis Algoritma Gradient Boosting, AdaBoost dan Catboost dalam Klasifikasi Kualitas Air,” J. Tek. Inform. dan Sist. Inf., vol. 8, no. 2, pp. 215–224, 2022.

A. Kulkarni, “Lifestyle Factors Influencing Osteoporosis Dataset,” Kaggle, 2024. https://www.kaggle.com/datasets/amitvkulkarni/lifestyle-factors-influencing-osteoporosis.

S. Kasau, D. M. Togubu, H. Hasmah, M. I. Nur, R. Rosdiana, and S. Sandrawati, “Model Klasifikasi Kanker Payudara Berdasarkan Status Keganasan Menggunakan Algoritma Random Forest dan Decision Tree,” semanTIK, vol. 8, no. 2, pp. 167–175, 2022.

L. Breiman, “Random forests,” Mach. Learn., vol. 45, no. 1, pp. 5–32, 2001.

M. Syukron, R. Santoso, and T. Widiharih, “Perbandingan Metode SMOTE Random Forest dan SMOTE XGBoost untuk Klasifikasi Tingkat Penyakit Hepatitis C pada Imbalance Class Data,” Gaussian J., vol. 11, no. 3, pp. 455–464, 2022.

I. Imanuddin, F. Alhadi, R. Oktafian, and A. Ihsan, “Deteksi Mata Mengantuk pada Pengemudi Mobil Menggunakan Metode Viola Jones,” MATRIK J. Manajemen, Tek. Inform. dan Rekayasa Komput., vol. 18, no. 2, pp. 321–329, 2019.

D. P. Sinambela, H. Naparin, M. Zulfadhilah, and N. Hidayah, “Implementasi Algoritma Decision Tree dan Random Forest dalam Prediksi Perdarahan Pascasalin,” J. Inf. dan Teknol., vol. 5, no. 3, pp. 58–64, 2023.

R. Artanto, W. Sujana, I. Made, and A. Agastya, “Application of Machine Learning Algorithm for Osteoporosis Disease Prediction System,” J. Artif. Intell. Comput. Sci., vol. 5, no. 1, pp. 78–87, 2024.

R. Irfannandhy, L. B. Handoko, and N. Ariyanto, “Analisis Performa Model Random Forest dan CatBoost dengan Teknik SMOTE dalam Prediksi Risiko Diabetes,” Edumatic J. Pendidik. Inform., vol. 8, no. 2, pp. 714–723, 2024.

N. F. Sahamony, T. Terttiaavini, and H. Rianto, “Analisis Perbandingan Kinerja Model Machine Learning untuk Memprediksi Risiko Stunting pada Pertumbuhan Anak,” MALCOM Indones. J. Mach. Learn. Comput. Sci., vol. 4, no. 2, pp. 413–422, 2024.

W. Apriliah, I. Kurniawan, M. Baydhowi, and T. Haryati, “Prediksi Kemungkinan Diabetes pada Tahap Awal Menggunakan Algoritma Klasifikasi Random Forest,” Sist. J. Sist. Inf., vol. 9, no. 3, pp. 504–516, 2020.

N. Ghaniaviyanto, “Analisis SMOTE Pada Klasifikasi Hepatitis C Berbasis Random Forest dan Naïve Bayes,” J. Inf. Technol. Comput. Sci., vol. 6, no. 3, pp. 167–175, 2022.

T. G. Dietterich, “Ensemble methods in machine learning,” Mult. Classif. Syst., vol. 1857, pp. 1–15, 2000.

R. Chairunisa and W. Astuti, “Perbandingan CART dan Random Forest untuk Deteksi Kanker berbasis Klasifikasi Data Microarray,” J. Teknol. Inf. dan Ilmu Komput., vol. 4, no. 2, pp. 112–120, 2017.

P. W. S. Aji, S. Suprianto, and R. Dijaya, “Prediksi Penyakit Stroke Menggunakan Metode Random Forest,” KESATRIA J. Penerapan Sist. Inf., vol. 4, no. 2, pp. 145–152, 2023.

Y. Freund and R. E. Schapire, “A decision-theoretic generalization of on-line learning and an application to boosting,” J. Comput. Syst. Sci., vol. 55, no. 1, pp. 119–139, 1997.

C. N. Prabiantissa, M. Kurniawan, and A. F. Bhahreisy, “Deteksi Mata dan Alis Menggunakan AdaBoost Classifier dan Haar Cascade,” J. FASILKOM, vol. 14, no. 3, pp. 705–714, 2024.

A. S. Munir and R. Waluyo, “Optimasi Prediksi Kematian pada Gagal Jantung: Analisis Perbandingan Algoritma Pembelajaran Ensemble dan Teknik Penyeimbangan Data,” J. Sist. dan Teknol. Inf., vol. 12, no. 2, pp. 365–378, 2024.

R. E. Schapire, “The strength of weak learnability,” Mach. Learn., vol. 5, no. 2, pp. 197–227, 1990.

S. Geman, E. Bienenstock, and R. Doursat, “Neural networks and the bias/variance dilemma,” Neural Comput., vol. 4, no. 1, pp. 1–58, 1992.

T. L. Beauchamp and J. F. Childress, Principles of Biomedical Ethics, 8th ed. New York: Oxford University Press, 2019.

P. Bossuyt, J. B. Reitsma, and D. E. Bruns, “STARD 2015: An updated list of essential items for reporting diagnostic accuracy studies,” BMJ, vol. 351, p. 5527, 2015.

J. H. Friedman, “Greedy function approximation: A gradient boosting machine," Annals of Statistics,” Ann. Stat., vol. 29, no. 5, pp. 1189–1232, 2001.

E. H. Shortliffe and J. J. Cimino, Biomedical Informatics: Computer Applications in Health Care and Biomedicine, 4th ed. London: Springer, 2014.

C. Roux, M. L. Brandi, and A. Diez-Perez, “Relationship between volumetric bone mineral density and microarchitecture in the distal radius and tibia by HR-pQCT in postmenopausal women,” Osteoporos. Int., vol. 24, no. 5, pp. 1609–1618, 2013.

D. G. Altman and J. M. Bland, “External validation of diagnostic tests,” Stat. Med., vol. 13, no. 12, pp. 1241–1249, 1994.

G. A. Colditz, “Overview of the epidemiology methods and applications: strengths and limitations of observational study designs,” Crit. Rev. Food Sci. Nutr., vol. 50, no. 1, pp. 10–12, 2010.

Additional Files

Published

2026-02-15

How to Cite

[1]
E. Parlindungan H, S. . Assegaff, and J. Jasmir, “Comparison of AdaBoost and Random Forest Methods in Osteoporosis Risk Prediction Based on Machine Learning”, J. Tek. Inform. (JUTIF), vol. 7, no. 1, pp. 201–209, Feb. 2026.