The paper constructs an OpenStack cloud platform on Inspur servers, including components such as Keystone, Glance, Placement, Nova, Neutron, and Dashboard. It specifically investigates the method of constructing GPU pass-through using Virtual I/O (VFIO) technology. The aim is to directly allocate graphics card resources to cloud hosts to meet high-performance computing needs. VFIO technology can directly assign physical devices (such as graphics cards) to virtual machines, avoiding performance loss and operational complexity associated with traditional virtualization. After passing through the graphics card to the cloud host via VFIO technology, it can directly control and manage the graphics card resources within the cloud host, providing better support for high-performance computing tasks. Test results show that the GPU pass-through design has a memory bandwidth of 107.5 GB/s, a pixel fill rate of 41.6 GPixel/s, a texture fill rate of 53.1 GTexel/s, a base frequency of 1326.9 MHz, and possesses a FP32 (float) floating-point performance of 2.1TFLOPS, with 72 texture units and 48 ROPs raster operations units. Moreover, in high-load computing scenarios, the pass-through graphics card exhibits superior availability and performance.
Read full abstract