A massively parallel and scalable multi-GPU material point method

Xinlei Wang,Yixin Zhu,Min Tang,Song-Chun Zhu,Stuart R Slattery,Chenfanfu Jiang,Dinesh Manocha,Minchen Li,Yuxing Qiu,Yu Fang

doi:10.1145/3386569.3392442

Abstract

Harnessing the power of modern multi-GPU architectures, we present a massively parallel simulation system based on the Material Point Method (MPM) for simulating physical behaviors of materials undergoing complex topological changes, self-collision, and large deformations. Our system makes three critical contributions. First, we introduce a new particle data structure that promotes coalesced memory access patterns on the GPU and eliminates the need for complex atomic operations on the memory hierarchy when writing particle data to the grid. Second, we propose a kernel fusion approach using a new Grid-to-Particles-to-Grid ( G2P2G ) scheme, which efficiently reduces GPU kernel launches, improves latency, and significantly reduces the amount of global memory needed to store particle data. Finally, we introduce optimized algorithmic designs that allow for efficient sparse grids in a shared memory context, enabling us to best utilize modern multi-GPU computational platforms for hybrid Lagrangian-Eulerian computational patterns. We demonstrate the effectiveness of our method with extensive benchmarks, evaluations, and dynamic simulations with elastoplasticity, granular media, and fluid dynamics. In comparisons against an open-source and heavily optimized CPU-based MPM codebase [Fang et al. 2019] on an elastic sphere colliding scene with particle counts ranging from 5 to 40 million, our GPU MPM achieves over 100x per-time-step speedup on a workstation with an Intel 8086K CPU and a single Quadro P6000 GPU, exposing exciting possibilities for future MPM simulations in computer graphics and computational science. Moreover, compared to the state-of-the-art GPU MPM method [Hu et al. 2019a], we not only achieve 2x acceleration on a single GPU but our kernel fusion strategy and Array-of-Structs-of-Array ( AoSoA ) data structure design also generalizes to multi-GPU systems. Our multi-GPU MPM exhibits near-perfect weak and strong scaling with 4 GPUs, enabling performant and large-scale simulations on a 1024 3 grid with close to 100 million particles with less than 4 minutes per frame on a single 4-GPU workstation and 134 million particles with less than 1 minute per frame on an 8-GPU workstation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A massively parallel and scalable multi-GPU material point method

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics

Lead the way for us

Journal: ACM Transactions on Graphics	Publication Date: Aug 12, 2020
Citations: 42

Similar Papers

Large deformation and brittle failure calculated using the dual-domain material point method
Paul L Barclay ... Duan Z Zhang
Computational Particle Mechanics | VOL. 11
Paul L Barclay, et. al.Paul L Barclay ... Duan Z Zhang
17 Jun 2023
Computational Particle Mechanics | VOL. 11

Run-out of the 2015 Shenzhen landslide using the material point method with the softening model
Butao Shi ... Yun Zhang
Bulletin of Engineering Geology and the Environment | VOL. 78
Butao Shi, et. al.Butao Shi ... Yun Zhang
09 Oct 2017
Run-out of the 2015 Shenzhen landslide using the material point method with the softening model
Butao Shi ... Yun Zhang

Material point method after 25 years: Theory, implementation, and applications
Alban De Vaucorbeil ... Sina Sinaie
-
Alban De Vaucorbeil, et. al.Alban De Vaucorbeil ... Sina Sinaie
01 Jan 2020
01 Jan 2020

IGIMP: An implicit generalised interpolation material point method for large deformations
T.J Charlton ... C.E Augarde
Computers & Structures | VOL. 190
T.J Charlton, et. al.T.J Charlton ... C.E Augarde
31 May 2017
Computers & Structures | VOL. 190

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A massively parallel and scalable multi-GPU material point method

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Graphics