Random Forest and Artificial Neural Network Data Mining for Environmental and Public Health Risk Modeling in Flood-Prone Urban Areas of Indonesia

Authors

  • Deni Mahdiana Faculty of Information Technology, Universitas Budi Luhur, Indonesia
  • Masato Ebine Faculty of Risk and Crisis Management, Chiba Institute of Science, Japan
  • Arief Wibowo Faculty of Information Technology, Universitas Budi Luhur, Indonesia

DOI:

https://doi.org/10.52436/1.jutif.2025.6.6.5387

Keywords:

artificial neural networks, data mining, environmental health, flood risk prediction, public health, random forest

Abstract

Floods in urban Indonesia pose severe environmental and public health challenges, exacerbating water contamination, vector proliferation, and disease outbreaks. Rapid urbanization, inadequate drainage systems, and climate change have intensified these impacts, emphasizing the need for integrated predictive frameworks. This study aims to develop a Data Mining (DM)-based modeling approach that combines environmental and health indicators to predict flood-related disease risks. Random Forest (RF) and Artificial Neural Network (ANN) algorithms were applied to multi-domain datasets from 30 flood-prone urban sub-districts between 2018 and 2023, encompassing rainfall, drainage density, land use, and water quality variables, integrated with disease incidence data such as diarrhea, dengue, and leptospirosis. The ANN model achieved superior predictive performance (93% accuracy, AUC 0.93) compared to RF (90% accuracy, AUC 0.90), identifying rainfall intensity, drainage density, and coliform contamination as the most influential predictors. These results demonstrate the capability of AI-driven DM techniques to capture complex interdependencies between environmental and health systems. The developed framework contributes to the field of informatics by providing a scalable, data-driven early warning tool for flood-related health risks, supporting evidence-based decision-making in disaster risk management and enhancing public health resilience in rapidly urbanizing regions.

Downloads

Download data is not yet available.

References

S. Aziz, A. Hamid, A. Shaikh, R. Mansoor, and A. Owings, “Unveiling Disparities in Heart Failure and Cirrhosis Related Mortality: CDC WONDER 1999 to 2020.,” Dig Dis Sci, p., 2025, doi: 10.1007/s10620-025-08970-8.

N. Byaruhanga, D. Kibirige, S. Gokool, and G. Mkhonta, “Evolution of Flood Prediction and Forecasting Models for Flood Early Warning Systems: A Scoping Review,” Water (Basel), p., 2024, doi: 10.3390/w16131763.

M. E. Carias, D. Johnston, R. Knott, and R. Sweeney, “Flood disasters and health among the urban poor,” Health Econ, vol. 31, pp. 2072–2089, 2021, doi: 10.1002/hec.4566.

S. Crawford et al., “Remobilization of pollutants during extreme flood events poses severe risks to human and environmental health.,” J Hazard Mater, vol. 421, p. 126691, 2021, doi: 10.1016/j.jhazmat.2021.126691.

A. Das and S. Sahoo, “Impact of land use and climate change on urban flooding: a case study of Bhubaneswar city in India,” Natural Hazards, p., 2025, doi: 10.1007/s11069-025-07130-5.

A. Dubey, Z. Yang, and G. Hattab, “A nested model for AI design and validation,” iScience, vol. 27, p., 2024, doi: 10.1016/j.isci.2024.110603.

Y. Himeur, B. Rimal, A. Tiwary, and A. Amira, “Using artificial intelligence and data fusion for environmental monitoring: A review and future perspectives,” Inf. Fusion, vol. 86–87, pp. 44–75, 2022, doi: 10.1016/j.inffus.2022.06.003.

D. Lee, H. Ahmadul, J. Patz, and P. Block, “Predicting social and health vulnerability to floods in Bangladesh,” Natural Hazards and Earth System Sciences, p., 2020, doi: 10.5194/nhess-2020-392-supplement.

J. Liu, Q. Huang, C. Ulishney, and C. Dumitrescu, “Comparison of Random Forest and Neural Network in Modelling the Performance and Emissions of a Natural Gas Spark Ignition Engine,” J Energy Resour Technol, p., 2021, doi: 10.1115/1.4053301.

H. Meng, X. An, and J. Xing, “A data-driven Bayesian network model integrating physical knowledge for prioritization of risk influencing factors,” Process Safety and Environmental Protection, p., 2022, doi: 10.1016/j.psep.2022.02.010.

B. Pham et al., “Flood risk assessment using hybrid artificial intelligence models integrated with multi-criteria decision analysis in Quang Nam Province, Vietnam,” J Hydrol (Amst), p., 2021, doi: 10.1016/j.jhydrol.2020.125815.

Y. Qiao, Y. Wang, N. Jin, S. Zhang, F. Giustozzi, and Tao, “Assessing flood risk to urban road users based on rainfall scenario simulations,” Transp Res D Transp Environ, p., 2023, doi: 10.1016/j.trd.2023.103919.

P. Rajpurkar, E. Chen, O. Banerjee, and E. Topol, “AI in health and medicine,” Nat Med, vol. 28, pp. 31–38, 2022, doi: 10.1038/s41591-021-01614-0.

J. Rogers, M. Maneta, S. Sain, L. Madaus, and J. Hacker, “The role of climate and population change in global flood exposure and vulnerability,” Nat Commun, vol. 16, p., 2025, doi: 10.1038/s41467-025-56654-8.

M. Saaristo et al., “Spatial and Temporal Dynamics of Chemical and Microbial Contamination in Nonurban Floodwaters,” Environ Sci Technol, vol. 58, pp. 21411–21422, 2024, doi: 10.1021/acs.est.4c03875.

S. Vardoulakis et al., “Building resilience to Australian flood disasters in the face of climate change,” Med J Aust, vol. 217, pp. 342–345, 2022, doi: 10.5694/mja2.51595.

B. Tellman et al., “Satellite imaging reveals increased proportion of population exposed to floods,” Nature, vol. 596, pp. 80–86, 2021, doi: 10.1038/s41586-021-03695-w.

X. Wang, X. Yu, and X. Yu, “Flood Disaster Risk Assessment Based on DEA Model in Southeast Asia along ‘The Belt and Road,’” Sustainability, p., 2022, doi: 10.3390/su142013145.

F. Zhu, Y. Wang, J. Zhou, C. Chen, L. Li, and G. Liu, “A Unified Framework for Cross-Domain and Cross-System Recommendations,” IEEE Trans Knowl Data Eng, vol. 35, pp. 1171–1184, 2021, doi: 10.1109/TKDE.2021.3104873.

C. Ahmadi, A. Karampourian, and M. R. Samarghandi, “Explain the challenges of evacuation in floods based on the views of citizens and executive managers,” Heliyon, vol. 8, no. 9, p. e10759, 2022, doi: https://doi.org/10.1016/j.heliyon.2022.e10759.

M. K. Dal Barco et al., “Integrating AI and climate change scenarios for multi-risk assessment in the coastal municipalities of the Veneto region,” Science of The Total Environment, vol. 965, p. 178586, 2025, doi: https://doi.org/10.1016/j.scitotenv.2025.178586.

A. Kern et al., “Seamlessly combined historical and projected daily meteorological datasets for impact studies in Central Europe: The FORESEE v4.0 and the FORESEE-HUN v1.0,” Clim Serv, vol. 33, p. 100443, 2024, doi: https://doi.org/10.1016/j.cliser.2023.100443.

K. Vasileiou, J. Barnett, and D. S. Fraser, “Integrating local and scientific knowledge in disaster risk reduction: A systematic review of motivations, processes, and outcomes,” International Journal of Disaster Risk Reduction, vol. 81, p. 103255, 2022, doi: https://doi.org/10.1016/j.ijdrr.2022.103255.

Z. Yu, K. Wang, Z. Wan, S. Xie, and Z. Lv, “Popular deep learning algorithms for disease prediction: a review,” Cluster Comput, vol. 26, no. 2, pp. 1231–1251, 2023, doi: 10.1007/s10586-022-03707-y.

S. Ahmad et al., “Building resilient urban drainage systems by integrated flood risk index for evidence-based planning,” J Environ Manage, vol. 374, p. 124130, 2025, doi: https://doi.org/10.1016/j.jenvman.2025.124130.

A. Sayed, Y. Himeur, and F. Bensaali, “From time-series to 2D images for building occupancy prediction using deep transfer learning,” Eng. Appl. Artif. Intell., vol. 119, p. 105786, 2023, doi: 10.1016/j.engappai.2022.105786.

N. Van Hau et al., “Deep learning models for forecasting dengue fever based on climate data in Vietnam,” PLoS Negl Trop Dis, vol. 16, p., 2022, doi: 10.1371/journal.pntd.0010509.

S. Nahatkar, A. S. Belhe, V. V. Ganthade, P. S. Uravane, and T. Pattewar, “Collating Random Forest Classifier and Artificial Neural Networks for the Risk Detection of Maternal Health,” 2025 International Conference on Computational, Communication and Information Technology (ICCCIT), pp. 446–451, 2025, doi: 10.1109/icccit62592.2025.10927891.

Additional Files

Published

2025-12-23

How to Cite

[1]
D. . Mahdiana, M. . Ebine, and A. . Wibowo, “Random Forest and Artificial Neural Network Data Mining for Environmental and Public Health Risk Modeling in Flood-Prone Urban Areas of Indonesia”, J. Tek. Inform. (JUTIF), vol. 6, no. 6, pp. 5805–5820, Dec. 2025.

Most read articles by the same author(s)