Shared Memory Parallel Machines Research Articles

Distributed-memory parallel processors (DMPPs) can deliver peak performance higher than vector supercomputers while promising a better cost-performance ratio. Programming, however, is harder than on traditional vector systems, especially when problems necessitating unstructured solution methods are considered. A class of such applications, with large resource requirements, is the numerical solution of partial differential equations (PDEs) on nonuniformly refined three-dimensional finite element discretizations. Porting an application of this class from vector and shared-memory parallel machines to DMPPs involves some fundamental algorithm changes, such as grid decomposition, mapping, and coloring strategies. In addition, no standardized language interface is available to ease the efficient parallelization and porting among DMPPs and between vector computers and DMPPs. This article describes how PILS-an existing package for the iterative solution of large unstructured sparse linear systems of equations on vector computers-was ported to DMPPs, using the parallelizing Fortran compiler Oxygen. Two DMPPs, namely an Intel Paragon and a Fujitsu AP1000, were used to evaluate the performance of the generated parallel program quantitatively. The results indicate how an application should be designed to be portable among supercomputers of different architecture. Several language and architecture features are essential for such a porting process and ease the parallelization of similar applications drastically.

Read full abstract

view Abstract Citations (37) References (13) Co-Reads Similar Papers Volume Content Graphics Metrics Export Citation NASA/ADS A Linear Moving Adaptive Particle-Mesh N-Body Algorithm Pen, Ue-Li Abstract We present the theory, algorithm, and numerical experiments for the implementation of a new N-body algorithm, called APM. It scales linearly with the number of particles for the computational effort per time step. The resolution is fully adaptive, with a typical smoothing length comparable to the local interparticle separation. This is accomplished through the use of a dynamical coordinate system, which adjusts itself to the local density distribution. For the Poisson solver a multigrid iteration scheme is used. The scheme is fully data parallel and data local. A specific implementation of this algorithm is described which is available to the astrophysics community. It is optimized for scalar, vector, and shared memory parallel machines. Performance characteristics are compared to traditional particle-mesh (PM) algorithms. The algorithm inherently has a threefold higher computing cost than the PM method with the same number of grid cells, but offers much higher resolution, and currently appears to be amongst the fastest adaptive high-resolution N-body algorithms. Publication: The Astrophysical Journal Supplement Series Pub Date: September 1995 DOI: 10.1086/192219 Bibcode: 1995ApJS..100..269P Keywords: METHODS: NUMERICAL full text sources ADS |

Read full abstract

Shared Memory Parallel Machines Research Articles

Related Topics

Articles published on Shared Memory Parallel Machines

Parallel Reservoir Simulator Computations

Matrix partitioning on a virtual shared memory parallel machine

Efficient parallel solution of structural eigenvalue problems

Migration of Vectorized Iterative Solvers to Distributed-Memory Architectures

A Linear Moving Adaptive Particle-Mesh N-Body Algorithm

Parallel Computation of Gröbner Bases on Distributed Memory Machines

A Framework for Distributed VLSI Simulation on a Network of Workstations

Optimal algorithms for the many-to-one routing problem on two-dimensional meshes

Fast parallel implementation of lazy languages—the EQUALS experience

Parallel techniques for computational geometry

Li + HCl RIOSA cross section calculations on parallel computers

A portable environment for developing parallel FORTRAN programs

The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel Computer

The NYU Ultracomputer—designing a MIMD, shared-memory parallel machine (Extended Abstract)

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Shared Memory Parallel Machines Research Articles

Related Topics

Articles published on Shared Memory Parallel Machines

Parallel Reservoir Simulator Computations

Matrix partitioning on a virtual shared memory parallel machine

Efficient parallel solution of structural eigenvalue problems

Migration of Vectorized Iterative Solvers to Distributed-Memory Architectures

A Linear Moving Adaptive Particle-Mesh N-Body Algorithm

Parallel Computation of Gröbner Bases on Distributed Memory Machines

A Framework for Distributed VLSI Simulation on a Network of Workstations

Optimal algorithms for the many-to-one routing problem on two-dimensional meshes

Fast parallel implementation of lazy languages—the EQUALS experience

Parallel techniques for computational geometry

Li + HCl RIOSA cross section calculations on parallel computers

A portable environment for developing parallel FORTRAN programs

The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel Computer

The NYU Ultracomputer—designing a MIMD, shared-memory parallel machine (Extended Abstract)