Abstract
In scientic computing, systems often manage computations that require continuous acquisition of of satellite data and the management of large databases, as well as the execution of analysis software and simulation models (e.g. Monte Carlo or molecular dynamics cell simulations) which may require several weeks of continuous run. These systems, consequently, should ensure the continuity of operation even in case of serious faults. HAVmS (High Availability Virtual machine System) is a highly available, "fault tolerant" system with zero downtime in case of fault. It is based on the use of Virtual Machines and implemented by two servers with similar characteristics. HAVmS, thanks to the developed software solutions, is unique in its kind since it automatically failbacks once faults have been fixed. The system has been designed to be used both with professional or inexpensive hardware and supports the simultaneous execution of multiple services such as: web, mail, computing and administrative services, uninterrupted computing, data base management. Finally the system is cost effective adopting exclusively open source solutions, is easily manageable and for general use.
Highlights
HAVmS effectively solves the problems related to the ”robustness” of computer systems wholly embracing the concept of high availability
All the above requirements have been met by an accurate choice of the available Open Source solutions meeting our targets and their integration with the Sw conceived by the authors
HAVmS is a Highly Available Fault Tolerant general purpose, recyclable system based on the use of VMs, assuring continuity of operation and no interruption in services providing in case of fault
Summary
HAVmS effectively solves the problems related to the ”robustness” of computer systems wholly embracing the concept of high availability. A system in high availability, HA in what follows, must ensure the continuity over time of the provided services, which, in case of fault, must be restored in the shortest possible time. HAVmS, through an accurate design of the Hw and Sw, significantly reduces the faults and their negative effects on the provide services. HAVmS has been designed mainly keeping in mind the following requirements: cost effectiveness, ease of management and, above all, the ability to automatically implement restoring strategies of the provided services without any interruption. The fault tolerance of the system is ensured by a continuous synchronization of the two servers. This synchronization keep the data and the states of all the virtual machines, VMs in what follows, perfectly aligned. Once the fault is fixed, it will be enough to reconnect the repaired server and automatically the failback procedure restores the proper functioning of the system including its HA capabilities
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.