Cache memory behavior of advanced PDE solvers

D Wallin,H Johansson,S Holmgren

doi:10.1016/s0927-5452(04)80061-3

Abstract

This chapter discusses three different partial differential equation (PDE) solver kernels in respect to cache memory performance on a simulated shared memory computer. The kernels implement state-of-the-art solution algorithms for complex application problems and the simulations are performed for data sets of realistic size. The performance of the studied applications benefits from much longer cache lines than normally found in commercially available computer systems. The reason for this is that, numerical algorithms are carefully coded and have regular memory access patterns. These programs take advantage of spatial locality and the amount of false sharing is limited. A simple sequential hardware prefetch strategy, providing cache behavior similar to a large cache line, could potentially yield large performance gains for these applications. Unfortunately, such prefetchers often lead to additional address snoops in multiprocessor caches. However, applying a bundle technique that lumps several read address transactions together, this large increase in address snoops can be avoided. For all studied algorithms, both the address snoops and cache misses are largely reduced in the bundled prefetch protocol.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cache memory behavior of advanced PDE solvers

Abstract

Talk to us

Similar Papers

More From: Advances in Parallel Computing

Lead the way for us

Journal: Advances in Parallel Computing	Publication Date: Jan 1, 2004
Citations: 39

Similar Papers

Analyzing Advanced PDE Solvers Through Simulation
Henrik Johansson ... Sverker Holmgren
-
Henrik Johansson, et. al.Henrik Johansson ... Sverker Holmgren
01 Jan 2006
01 Jan 2006

DiscretizationNet: A machine-learning based solver for Navier–Stokes equations using finite volume discretization
Rishikesh Ranade ... Jay Pathak
Computer Methods in Applied Mechanics and Engineering | VOL. 378
Rishikesh Ranade, et. al.Rishikesh Ranade ... Jay Pathak
26 Feb 2021
Computer Methods in Applied Mechanics and Engineering | VOL. 378

Dot-World: The Easy 2D and 3D Simulator to Enhance Education in Developing Countries
Derek Fong ... Charlotte Lew
-
Derek Fong, et. al.Derek Fong ... Charlotte Lew
02 Dec 2022
02 Dec 2022

Spatially Dispersionless, Unconditionally Stable FC–AD Solvers for Variable-Coefficient PDEs
O P Bruno ... A Prieto
Journal of Scientific Computing | VOL. 58
O P Bruno, et. al.O P Bruno ... A Prieto
05 Jun 2013
Journal of Scientific Computing | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cache memory behavior of advanced PDE solvers

Abstract

Talk to us

Similar Papers

More From: Advances in Parallel Computing