Abstract

Most of the data generated on high-performance computing systems are transferred to storage in remote systems for various purposes such as backup. To detect data corruption caused by network or storage failures during data transfer, the receiver system verifies data integrity by comparing the checksum of the data. However, the internal operation of the storage device is not sufficiently investigated in the existing end-to-end integrity verification techniques. In this paper, we propose an efficient and robust data integrity verification scheme for large-scale data transfer between computing systems with high-performance storage devices. To ensure the robustness of the integrity verification, we control the order of I/O operations. In addition, we parallelize checksum computing and overlap it with I/O operations to make the integrity verification efficient.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call