Abstract

We optimized Moving Particle Simulation (MPS) method for Kepler GPU. Solving sparse matrix occupies a large portion of particle simulation because it is a casus of performance bottlenecks. We reported at NVIDIA's GPU Technology Conference 2012 that we achieved to accelerate MPS method about 7x faster on NVIDIA Tesla C2070 compared with on Intel Core i7 920 (4 cores). Last year, NVIDIA released the latest GPU, Kepler. We optimized and accelerated particle simulation on Kepler GPU by using new shuffle instruction and read-only data cache. We obtained to accelerate Sparse Matrix-Vector multiplication operation 1.48x faster on Kepler (Tesla K20c) compared with on Fermi (Tesla C2075).

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.