Optimization of Machine Learning Model using Grid and Random Search Algorithms for Predicting Student Dropout
DOI:
https://doi.org/10.52436/1.jutif.2026.7.3.5627Keywords:
Ada Boost, Grid Search, Machine Learning, Student Dropout, Student Performance, XGBoostAbstract
Student dropout is a serious problem that can affect the quality of education and operational efficiency of higher education institutions. Early prediction of potential students who will dropout is essential to develop appropriate intervention strategies, so as to increase graduation rates and reduce the negative impact on academic continuity. A better model for student dropout prediction becomes an objective of this research. The method used in this research is to improve the performance of machine learning models through the selection of optimal hyperparameters. The research methodology consists of several stages, including data preprocessing, handling imbalanced data, model training, and performance evaluation. There are three machine learning models used in this research, namely XGBoost, AdaBoost, and Random Forest. The selection of optimal hyperparameter values is carried out using the Random Search and Grid Search methods. Model evaluation is conducted using k-fold cross-validation and multiple evaluation metrics, including accuracy, precision, recall, and F1-score. As part of the important results, the combination of XGBoost and Random Search produced the best performance with 91.18% accuracy, indicating that hyperparameter optimization significantly improves predictive performance. The findings of this research explicitly contribute to the field of informatics, particularly educational data mining, and provide insights for educational institutions to identify high-risk dropout students more accurately.
Downloads
References
D. Baneres, M. E. Rodriguez-Gonzalez, and A. E. Guerrero-Roldan, “A Real-Time Predictive Model for Identifying Course Dropout in Online Higher Education,” IEEE Transactions on Learning Technologies, vol. 16, no. 4, pp. 484–499, Aug. 2023, doi: 10.1109/TLT.2023.3267275.
N. F. B. Casillano and K. W. Cantilang, “Employing educational data mining techniques to predict programming students at-risk of dropping out,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 35, no. 2, pp. 1219–1226, Aug. 2024, doi: 10.11591/ijeecs.v35.i2.pp1219-1226.
Z. Alj and M. O. C. Malki, “Predicting Students Drop-out in MOOC from Learning Behavior Using Machine Learning,” Journal of Uncertain Systems, vol. 16, no. 3, Sep. 2023, doi: 10.1142/S1752890922500118.
C. G. De Jesus and M. K. C. Ledda, “Intervention Support Program for Students at Risk of Dropping out Using Fuzzy Logic-Based Prescriptive Analytics,” in Proceeding - 2021 IEEE 17th International Colloquium on Signal Processing and Its Applications, CSPA 2021, 2021. doi: 10.1109/CSPA52141.2021.9377304.
Nurmalitasari, Z. Awang Long, and M. Faizuddin Mohd Noor, “Factors Influencing Dropout Students in Higher Education,” Educ Res Int, vol. 2023, 2023, doi: 10.1155/2023/7704142.
F. F. Ananna, R. Nowreen, S. S. R. Al Jahwari, E. A. Costa, L. Angeline, and S. R. Sindiramutty, “Analysing Influential Factors in Student Academic Achievement: Prediction Modelling and Insight,” International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence, vol. 2, no. 1, 2023, doi: 10.54938/ijemdcsai.2023.02.1.254.
A. T. Rahman and A. Raihan, “Prediction of Student Performance in Higher Academy Through Different Data Mining Techniques,” in 4th International Conference on Electrical, Computer and Telecommunication Engineering, ICECTE 2022, 2022. doi: 10.1109/ICECTE57896.2022.10114531.
E. E. Estandarte, D. P. Agbon, A. C. Azares, M. C. Elape, J. C. M. Bustillo, and J. R. Delos Arcos, “Prediction-based techniques on Academic Performance of Graduating Students through Machine Learning,” in 7th International Conference on Inventive Computation Technologies, ICICT 2024, Institute of Electrical and Electronics Engineers Inc., 2024, pp. 281–287. doi: 10.1109/ICICT60155.2024.10544915.
S. Tosun and D. Bakan Kalaycıoğlu, “Data mining approach for prediction of academic success in open and distance education,” Journal of Educational Technology and Online Learning, vol. 7, no. 2, pp. 168–176, May 2024, doi: 10.31681/jetol.1334687.
A. Wahdan, S. Hantoobi, M. Al-Emran, and K. Shaalan, “Early Detecting Students at Risk Using Machine Learning Predictive Models,” in Lecture Notes in Networks and Systems, 2022. doi: 10.1007/978-3-030-85990-9_27.
V. Christou et al., “Performance and early drop prediction for higher education students using machine learning,” Expert Syst Appl, vol. 225, Sep. 2023, doi: 10.1016/j.eswa.2023.120079.
S. A. Alwarthan, N. Aslam, and I. U. Khan, “Predicting Student Academic Performance at Higher Education Using Data Mining: A Systematic Review,” 2022, Hindawi Limited. doi: 10.1155/2022/8924028.
D. Andrade-Girón et al., “Predicting Student Dropout based on Machine Learning and Deep Learning: A Systematic Review,” EAI Endorsed Transactions on Scalable Information Systems, vol. 10, no. 5, 2023, doi: 10.4108/eetsis.3586.
I. Issah, O. Appiah, P. Appiahene, and F. Inusah, “A systematic review of the literature on machine learning application of determining the attributes influencing academic performance,” 2023. doi: 10.1016/j.dajour.2023.100204.
L. Kemper, G. Vorhoff, and B. U. Wigger, “Predicting student dropout: A machine learning approach,” European Journal of Higher Education, vol. 10, no. 1, 2020, doi: 10.1080/21568235.2020.1718520.
J. Kabathova and M. Drlik, “Towards predicting student’s dropout in university courses using different machine learning techniques,” Applied Sciences (Switzerland), vol. 11, no. 7, Apr. 2021, doi: 10.3390/app11073130.
N. Mduma, K. Kalegele, and D. Machuve, “A survey of machine learning approaches and techniques for student dropout prediction,” 2019. doi: 10.5334/dsj-2019-014.
S. S. Rezk and K. S. Selim, “Comparing nine machine learning classifiers for school-dropouts using a revised performance measure,” J Comput Soc Sci, Oct. 2024, doi: 10.1007/s42001-024-00281-8.
E. S. Karabacak and Y. Yaslan, “Comparison of Machine Learning Methods for Early Detection of Student Dropouts,” in UBMK 2023 - Proceedings: 8th International Conference on Computer Science and Engineering, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 376–381. doi: 10.1109/UBMK59864.2023.10286747.
S. W. Masood, M. Gogoi, and S. A. Begum, “Optimised SMOTE-based Imbalanced Learning for Student Dropout Prediction,” Arab J Sci Eng, 2024, doi: 10.1007/s13369-024-09287-w.
A. López-García, O. Blasco-Blasco, M. Liern-García, and S. E. Parada-Rico, “Early detection of students’ failure using Machine Learning techniques,” Operations Research Perspectives, vol. 11, 2023, doi: 10.1016/j.orp.2023.100292.
Z. Alj, A. Bouayad, and O. Cherkaoui Malki Mohammed, “Intelligent System for Personalised Interventions and Early Drop-out Prediction in MOOCs,” 2022. [Online]. Available: www.ijacsa.thesai.org
W. Sabbir, Abdullah-Al-Kafi, A. S. M Afridi, M. S. Rahman, and M. Karmakar, “Improving Predictive Analytics for Student Dropout: A Comprehensive Analysis and Model Evaluation,” 11th International Conference on Computing for Sustainable Global Development (INDIACom), Feb. 2024.
J. Niyogisubizo, L. Liao, E. Nziyumva, E. Murwanashyaka, and P. C. Nshimyumukiza, “Predicting student’s dropout in university classes using two-layer ensemble machine learning approach: A novel stacked generalization,” Computers and Education: Artificial Intelligence, vol. 3, Jan. 2022, doi: 10.1016/j.caeai.2022.100066.
K. Roy and D. M. Farid, “An Adaptive Feature Selection Algorithm for Student Performance Prediction,” IEEE Access, vol. 12, pp. 75577–75598, 2024, doi: 10.1109/ACCESS.2024.3406252.
P. Berka and L. Marek, “Bachelor’s degree student dropouts: Who tend to stay and who tend to leave?,” Studies in Educational Evaluation, vol. 70, 2021, doi: 10.1016/j.stueduc.2021.100999.
Alamsyah, B. Prasetiyo, M. F. Al Hakim, and F. D. Pradana, “The improvement of COVID-19 prediction accuracy using optimal parameters in reccurent neural network model,” in AIP Conference Proceedings, Semarang, 2023, p. 040019. doi: 10.1063/5.0125767.
J. Li, Q. Zhu, Q. Wu, and Z. Fan, “A novel oversampling technique for class-imbalanced learning based on SMOTE and natural neighbors,” Inf Sci (N Y), vol. 565, no. 1, pp. 438–455, 2021, doi: 10.1016/j.ins.2021.03.041.
B. Prasetiyo, Alamsyah, M. A. Muslim, and N. Baroroh, “Evaluation performance recall and F2 score of credit card fraud detection unbalanced dataset using SMOTE oversampling technique,” J Phys Conf Ser, vol. 1918, no. 4, p. 42002, Jun. 2021, doi: 10.1088/1742-6596/1918/4/042002.
G. A. Pradipta, R. Wardoyo, A. Musdholifah, I. N. H. Sanjaya, and M. Ismail, “SMOTE for Handling Imbalanced Data Problem : A Review,” in 2021 6th International Conference on Informatics and Computing, ICIC 2021, 2021. doi: 10.1109/ICIC54025.2021.9632912.
Q. Wang et al., “XGBoost algorithm assisted multi-component quantitative analysis with Raman spectroscopy,” Spectrochim Acta A Mol Biomol Spectrosc, vol. 323, p. 124917, Dec. 2024, doi: 10.1016/j.saa.2024.124917.
M. Ma et al., “XGBoost-based method for flash flood risk assessment,” J Hydrol (Amst), vol. 598, p. 126382, 2021, doi: 10.1016/j.jhydrol.2021.126382.
J. Wu, Y. Li, and Y. Ma, “Comparison of XGBoost and the Neural Network model on the class-balanced datasets,” in 2021 IEEE 3rd International Conference on Frontiers Technology of Information and Computer, ICFTIC 2021, 2021, pp. 457–461. doi: 10.1109/ICFTIC54370.2021.9647373.
C. Tan, F. Yin, and T. Jiang, “Classification method of seabed sonar image substrate based on ELM-AdaBoost,” J Radiat Res Appl Sci, p. 101126, Sep. 2024, doi: 10.1016/j.jrras.2024.101126.
J. Bian, J. Wang, and Q. Yece, “A novel study on power consumption of an HVAC system using CatBoost and AdaBoost algorithms combined with the metaheuristic algorithms,” Energy, vol. 302, p. 131841, Dec. 2024, doi: 10.1016/j.energy.2024.131841.
C. Rao, M. Li, T. Huang, and F. Li, “Stroke Risk Assessment Decision-Making Using a Machine Learning Model: Logistic-AdaBoost,” CMES - Computer Modeling in Engineering and Sciences, vol. 139, no. 1, pp. 699–724, 2023, doi: 10.32604/cmes.2023.044898.
L. Hao and G. Huang, “An improved AdaBoost algorithm for identification of lung cancer based on electronic nose,” Heliyon, vol. 9, no. 3, pp. 1–14, 2023, doi: 10.1016/j.heliyon.2023.e13633.
M. F. Al Hakim, S. Subhan, and P. Listiaji, “SOCIAL MEDIA COMMENTS FOR GOVERNMENT INSTITUTION VIDEO CLASSIFICATION USING MACHINE LEARNING,” JITK (Jurnal Ilmu Pengetahuan dan Teknologi Komputer), vol. 10, no. 2, pp. 433–440, Nov. 2024, doi: 10.33480/jitk.v10i2.5187.
M. F. Al Hakim, M. N. Tiadah, and S. Subhan, “Diabetes Diagnosis Using Machine Learning Model with Feature Selection,” in 2025 International Conference on Smart Computing, IoT and Machine Learning (SIML), Surakarta: IEEE Access, Jun. 2025, pp. 1–6. doi: 10.1109/SIML65326.2025.11080810.
J. Magidi, L. Nhamo, S. Mpandeli, and T. Mabhaudhi, “Application of the random forest classifier to map irrigated areas using google earth engine,” Remote Sens (Basel), vol. 13, no. 5, p. 876, 2021, doi: 10.3390/rs13050876.
G. Garg and R. Garg, “Brain Tumor Detection and Classification based on Hybrid Ensemble Classifier,” pp. 1–18, 2021, doi: 10.48550/arXiv.2101.00216.
X. Chai, S. Li, and F. Liang, “A novel battery SOC estimation method based on random search optimized LSTM neural network,” Energy, vol. 306, p. 132583, Dec. 2024, doi: 10.1016/j.energy.2024.132583.
F. Hosseini, C. Prieto, and C. Álvarez, “Hyperparameter optimization of regional hydrological LSTMs by random search: A case study from Basque Country, Spain,” J Hydrol (Amst), vol. 643, p. 132003, Nov. 2024, doi: 10.1016/j.jhydrol.2024.132003.
S. A. Kawa and M. Arifwani, “Randomized search on a grid of CNN networks with simplified search space,” in Proceedings of the 2021 8th International Conference on Computing for Sustainable Global Development, INDIACom 2021, 2021, pp. 68–73. doi: 10.1109/INDIACom51348.2021.00014.
F. S. Gharehchopogh, M. Namazi, L. Ebrahimi, and B. Abdollahzadeh, “Advances in Sparrow Search Algorithm: A Comprehensive Survey,” Archives of Computational Methods in Engineering, vol. 30, no. 1, pp. 427–455, 2023, doi: 10.1007/s11831-022-09804-w.
D. Marinov and D. Karapetyan, “Hyperparameter optimisation with early termination of poor performers,” in 2019 11th Computer Science and Electronic Engineering Conference, CEEC 2019 - Proceedings, 2019, pp. 160–163. doi: 10.1109/CEEC47804.2019.8974317.
H. Alibrahim and S. A. Ludwig, “Hyperparameter Optimization: Comparing Genetic Algorithm against Grid Search and Bayesian Optimization,” in 2021 IEEE Congress on Evolutionary Computation, CEC 2021 - Proceedings, 2021, pp. 1551–1559. doi: 10.1109/CEC45853.2021.9504761.
N. Zhu, C. Zhu, L. Zhou, Y. Zhu, and X. Zhang, “Optimization of the Random Forest Hyperparameters for Power Industrial Control Systems Intrusion Detection Using an Improved Grid Search Algorithm,” Applied Sciences (Switzerland), vol. 12, no. 20, p. 10456, 2022, doi: 10.3390/app122010456.
I. Priyadarshini and C. Cotton, “A novel LSTM–CNN–grid search-based deep neural network for sentiment analysis,” Journal of Supercomputing, vol. 77, no. 12, pp. 13911–13932, 2021, doi: 10.1007/s11227-021-03838-w.
S. Gowriswari and S. Brindha, “Hyperparameters Optimization using Gridsearch Cross Validation Method for machine learning models in Predicting Diabetes Mellitus Risk,” in 2022 International Conference on Communication, Computing and Internet of Things, IC3IoT 2022 - Proceedings, 2022, pp. 1–4. doi: 10.1109/IC3IOT53935.2022.9768005.
D. A. Anggoro and S. S. Mukti, “Performance Comparison of Grid Search and Random Search Methods for Hyperparameter Tuning in Extreme Gradient Boosting Algorithm to Predict Chronic Kidney Failure,” International Journal of Intelligent Engineering and Systems, vol. 14, no. 6, pp. 198–207, 2021, doi: 10.22266/ijies2021.1231.19.
M. K. Sharma and M. Yadav, “Predicting Students’ Drop-Out Rate Using Machine Learning Models: A Comparative Study,” in Proceedings of the 2022 3rd International Conference on Intelligent Computing, Instrumentation and Control Technologies: Computational Intelligence for Smart Systems, ICICICT 2022, Institute of Electrical and Electronics Engineers Inc., 2022, pp. 1166–1171. doi: 10.1109/ICICICT54557.2022.9917841.
S. Mohammad, I. A. Chowdhury, N. Roy, Md. N. Hasan, and D. Nandi, “Investigation of Student Dropout Problem by Using Data Mining Technique,” International Journal of Education and Management Engineering, vol. 13, no. 5, pp. 43–61, Oct. 2023, doi: 10.5815/ijeme.2023.05.04.
P. Padmasiri and S. Kasthuriarachchi, “Interpretable Prediction of Student Dropout Using Explainable AI Models,” in Proceedings - International Research Conference on Smart Computing and Systems Engineering, SCSE 2024, Institute of Electrical and Electronics Engineers Inc., 2024. doi: 10.1109/SCSE61872.2024.10550525.
Additional Files
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 M. Faris Al Hakim, Siti Wahyuni, Kholiq Budiman, Aditya Marianti, Bambang Eko Susilo, Nuni Widiarti, Sri Sukaesih, Rifaatunnisa

This work is licensed under a Creative Commons Attribution 4.0 International License.





