Abstract

Reproducibility for High Performance Computing (HPC) systems has been discussed for some time already, but more work should be carried out to cover the latest accelerators that equip the fastest supercomputers such as the ones listed in Top500. In this paper, we perform a replication of a performance evaluation carried out using an N-Body Open MP parallel application on a XeonPhi accelerator. We also compare the obtained performance with a similar N-Body CUDA application. Besides encountering intriguing results about the Xeon Phi on the number of hardware threads, our comparison against Nvidia boards using the same load shows that the execution Xeon Phi is slower than on Nvidia K20 and GTX760 accelerators.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.