Abstract
To fully exploit hardware resources and maximize the throughput of the heterogeneous systems, a graphics processing unit (GPU)+CPU collaborative computing method for the finite-difference time-domain (FDTD) method scattering simulation is proposed in this article. For the workload imbalance caused by different computational performances and device-module affinity of different processors in the heterogeneous systems, a dynamic load balancing method is proposed, which can distribute parallel tasks intelligently according to the captured runtime performance index of processors. The method proposed herein can be applied to different heterogeneous hardware environments adaptively, owing to its online adjustment of workload distribution. Simulation results show that the proposed dynamic load balancing-based heterogeneous parallel FDTD method does not introduce additional errors in solving the scattering problems and has a great improvement in efficiency compared to a parallel implementation using a single kind of processor. It shows up to $56.76\times $ and $5.85\times $ speedups over the multicore CPU and multi-GPU parallel execution, respectively, which is quite an attractive improvement for practical applications.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have