Implementing Proxmox VE-Based High Availability Clustering with Ceph Replication and Performance Testing for Resilient IT Infrastructure in High-Risk Disaster Areas
DOI:
https://doi.org/10.52436/1.jutif.2026.7.3.5591Keywords:
Ceph Storage, Disaster-Prone Area, Failover Mechanism, High Availability, Proxmox Virtual Environment, VirtualizationAbstract
IT infrastructure in disaster-prone areas, particularly along Java's southern coastal region within the Sunda Arc subduction zone, faces significant vulnerability to seismic events and tsunamis that cause critical system downtime, disrupting emergency coordination and exacerbating disaster impacts. This study aims to develop and validate an open-source High Availability (HA) solution using Proxmox Virtual Environment (PVE) ensuring service continuity with Recovery Time Objective (RTO) under 2 minutes and near-zero Recovery Point Objective (RPO). The methodology encompasses four systematic stages: needs analysis identifying infrastructure requirements and disaster risk assessment for Cilacap region; architecture design implementing three-node PVE cluster with Ceph distributed storage (replication factor 3) and Corosync quorum mechanism; system implementation including network bonding, VLAN segmentation, and dedicated 1Gbps Ceph replication network; and comprehensive performance testing through fault injection scenarios (power-off simulation, network partition, storage failure) measuring inter-node latency, disk I/O performance, and failover recovery metrics. Results demonstrate exceptional reliability with 99.92% availability over 72-hour monitoring, Mean Time Between Failures (MTBF) of 24.1 hours, and Mean Time To Recovery (MTTR) of 70 seconds with total downtime of 3.53 minutes across three failover simulations. Inter-node latency remains below 1ms (average 0.372-0.593ms), while disk I/O latency maintains sub-0.5ms performance during failover events. This research contributes to computer science and disaster informatics by providing a validated, replicable open-source blueprint for resilient IT infrastructure in Indonesia's disaster-prone regions, offering practical implementation pathways for integration with national emergency systems including BNPB coordination networks and BMKG early warning infrastructure.
Downloads
References
A. Isdianto, D. Kurniasari, A. Subagiyo, M. F. Haykal, and S. Supriyadi, “Pemetaan Kerentanan Tsunami untuk Mendukung Ketahanan Wilayah Pesisir,” Jurnal Permukiman, vol. 16, no. 2, p. 90, Dec. 2021, doi: 10.31815/jp.2021.16.90-100.
F. Muttaqy, A. D. Nugraha, J. Mori, N. T. Puspito, P. Supendi, and S. Rohadi, “Seismic Imaging of Lithospheric Structure Beneath Central-East Java Region, Indonesia: Relation to Recent Earthquakes,” Front Earth Sci (Lausanne), vol. 10, Jan. 2022, doi: 10.3389/feart.2022.756806.
M. Angglena and F. Pandey, “Pemetaan Zona Rawan Gempa Berdasarkan Jejak Historis Gempa dan Kerusakan Insrastruktur,” INNOVATIVE: Journal Of Social Science Research, vol. 5, no. 3, pp. 204–213, 2025.
J. Jumadi et al., “Tsunami Risk Mapping and Sustainable Mitigation Strategies for Megathrust Earthquake Scenario in Pacitan Coastal Areas, Indonesia,” Sustainability (Switzerland), vol. 17, no. 6, 2025, doi: 10.3390/su17062564.
M. Krichen, M. S. Abdalzaher, M. Elwekeil, and M. M. Fouda, “Managing natural disasters: An analysis of technological advancements, opportunities, and challenges,” Jan. 01, 2024, KeAi Communications Co. doi: 10.1016/j.iotcps.2023.09.002.
W. Jin and Y. Zhang, “Identifying Critical Success Factors of an Emergency Information Response System Based on the Similar-DEMATEL Method,” Sustainability, vol. 15, no. 20, p. 14823, Oct. 2023, doi: 10.3390/su152014823.
D. Sam and X. Liu, “The impact of unplanned system outages on critical infrastructure sectors: cybersecurity perspective,” Issues in Information Systems, vol. 24, no. 4, pp. 273–281, 2023, doi: 10.48009/4_iis_2023_121.
Nagamalleswararao Bellamkonda, “Technical Review: High availability in modern database systems,” World Journal of Advanced Engineering Technology and Sciences, vol. 16, no. 1, 2025, doi: 10.30574/wjaets.2025.16.1.0945.
Prinafsika, A. Junaidi, and M. Muharrom Al Haromainy, “Cloud-Based High Availability Architecture Using Least Connection Load Balancer and Integrated Alert System,” bit-Tech, vol. 8, no. 1, pp. 263–274, Aug. 2025, doi: 10.32877/bt.v8i1.2520.
J. K. T. - and V. N. M. -, “High Availability Database Infrastructure with Galera and HAProxy,” International Journal For Multidisciplinary Research, vol. 6, no. 6, 2024, doi: 10.36948/ijfmr.2024.v06i06.30449.
P. Srikanth and S. R. Patchamatla, “SLA-Driven Fault-Tolerant Architectures for Telecom Cloud: Achieving 99.98% Uptime,” Certified Journal | 9733 International Journal of Computer Technology and Electronics Communication, vol. 9001, 2008, doi: 10.15680/IJCTECE.2024.0706003.
Y. Ariyanto, “SINGLE SERVER-SIDE AND MULTIPLE VIRTUAL SERVER-SIDE ARCHITECTURES: PERFORMANCE ANALYSIS ON PROXMOX VE FOR E-LEARNING SYSTEMS,” Journal of Engineering and Technology for Industrial Applications, vol. 9, no. 44, 2023, doi: 10.5935/jetia.v9i44.903.
J. I. Vega-Luna, G. Salgado-Guzmán, J. F. Cosme-Aceves, V. N. Tapia-Vargas, and E. A. Andrade-González, “Linux cluster programming for high availability with an Oracle instance,” Journal of Applied Research and Technology, vol. 23, no. 2, 2025, doi: 10.22201/icat.24486736e.2025.23.2.2610.
S. Dottor Fabrizio Romano Graduand Enrico Martin, “Virtualization and Containerization a new concept for data center management to optimize resources distribution.”
E. Gamess, S. Banjara, and D. Winston, “Assessing the Capabilities of Several Raspberry Pi Models for Virtualization with Proxmox VE,” in ACMSE 2025 - Proceedings of the 2025 ACM Southeast Conference, Association for Computing Machinery, Inc, May 2025, pp. 58–66. doi: 10.1145/3696673.3723057.
S. Bhatia and C. Gabhane, “Proxmox, RedHat, and Beyond,” in Navigating VMware Turmoil in the Broadcom Era, 2025. doi: 10.1007/979-8-8688-1264-4_7.
V. P. Oleksiuk and O. R. Oleksiuk, “The practice of developing the academic cloud using the Proxmox VE platform,” Educational Technology Quarterly, vol. 2021, no. 4, 2021, doi: 10.55056/etq.36.
V. Oleksiuk, O. Oleksiuk, and O. Spirin, “Comparative Study of the Support of Academic Clouds Based on Apache CloudStack and Proxmox VE Platforms,” INSTICC, Jul. 2023, pp. 349–361. doi: 10.5220/0012064300003431.
S. ANTONIO VIEIRA, “TECHNOLOGIES FOR SERVER VIRTUALIZATION: PROXMOX AND XEN HYPERVISOR,” in Proceedings of the 18th CONTECSI International Conference on Information Systems and Technology Management, 2021. doi: 10.5748/18contecsi/pse/itm/6843.
G. Rycaj, “Comparison of virtualization performance of Proxmox, OpenVZ, OpenNebula, Vmware ESX and Xen Server,” Journal of Computer Sciences Institute, vol. 12, pp. 214–219, Sep. 2019, doi: 10.35784/jcsi.490.
L. Zhu, Z. Li, T. Tang, and X. Wang, “A High-availability Urban Rail Cloud Platform Based on OpenStack: Design, Implementation and Availability Analysis,” Tiedao Xuebao/Journal of the China Railway Society, vol. 46, no. 2, 2024, doi: 10.3969/j.issn.1001-8360.2024.02.011.
A. Malhotra, A. Elsayed, R. Torres, and S. Venkatraman, “Evaluate Solutions for Achieving High Availability or Near Zero Downtime for Cloud Native Enterprise Applications,” IEEE Access, vol. 11, pp. 85384–85394, 2023, doi: 10.1109/ACCESS.2023.3303430.
A. Dudnik, M. L. Supervisor, and M. Juutilainen, “Creating a high-availability cluster with two physical servers andvirtual machines,” University of Applied sciences, south-eastern Finlandia, 2017.
M. Goodwin, “What Is an SLA (service level agreement)? | IBM,” ibm. [Online]. Available: https://www.ibm.com/think/topics/service-level-agreement
T. Anderson, T. Roscoe, and D. Wetherall, “Preventing internet denial-of-service with capabilities,” in Computer Communication Review, 2004. doi: 10.1145/972374.972382.
S. A. Weil, S. A. Brandt, E. L. Miller, D. D. E. Long, and C. Maltzahn, “CEpH: A scalable, high-performance distributed file system,” in OSDI 2006 - 7th USENIX Symposium on Operating Systems Design and Implementation, 2006.
F. Fornari et al., “Distributed file systems performance tests on Kubernetes/Docker clusters,” in Journal of Physics: Conference Series, Institute of Physics, 2023. doi: 10.1088/1742-6596/2438/1/012030.
J. W. A. Saputra, M. Faiqurahman, and F. D. S. Sumadi, “Analisis Mekanisme Failover Controller Pada Software Defined Network,” Jurnal Repositor, vol. 3, no. 5, 2021, doi: 10.22219/repositor.v3i5.1329.
M. F. Bari et al., “Data center network virtualization: A survey,” IEEE Communications Surveys and Tutorials, vol. 15, no. 2, 2013, doi: 10.1109/SURV.2012.090512.00043.
R. Eswaran, “Reducing Hypervisor Overhead for Virutal Interrupt Delivery in KVM/ARM Platform,” 2024.
M. H. Jamil, M. S. Q. Z. Nine, and T. Kosar, “EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training,” Association for Computing Machinery (ACM), Nov. 2025, pp. 2022–2031. doi: 10.1145/3731599.3767566.
K. M. U. Ahmed, M. Alvarez, and M. H. J. Bollen, “Reliability Analysis of Internal Power Supply Architecture of Data Centers in Terms of Power Losses,” Electric Power Systems Research, vol. 193, 2021, doi: 10.1016/j.epsr.2021.107025.
Additional Files
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Muhammad Abdul Muin, Rahmawan Bagus Trianto, Muhammad Nur Faiz, Ratih Hafsarah Maharrani, Satriawan Desmana

This work is licensed under a Creative Commons Attribution 4.0 International License.





