Connection Machine System Research Articles

The fluid flow in a three-dimensional twisted channel is modeled by both the compressible Navier-Stokes equations, and the Euler equations. A three stage Runge-Kutta method is used for integrating the system of equations in time. A second-order accurate, centered difference scheme is used for spatial derivatives of the flux variables. For both the Euler and the Navier-Stokes equations artificial viscosity introduced through fourth-order centered differences is used to stabilize the numeric scheme. By using lower order difference approximations on or close to the boundary than in the interior, the difference stencils can be evaluated at all grid points concurrently. A few different difference molecules for the boundaries, and different factorizations of the fourth-order difference operators were evaluated. With the appropriate factorization of the difference stencils, six variables per lattice point suffice for the evaluation of the difference stencils occurring in the code. The three fourth-order stencils we investigated, including three different factorizations of one of these stencils, account for three out these six variables. The convergence rate for all stencils and their factorizations is approximately the same for the first 1000–1500 steps at which point the residual has reached a value of 10 −2–10 −3. From this point on the convergence rate for one of the factorizations of the fourth-order stencil is approximately twice that of one of the unfactored stencils. A performance of 1.05 Gflops/s was demonstrated on 65 536 processor Connection Machine system with 512 Mbytes of primary storage. The performance scales in proportion to the number of processors. The performance on 8k processor configurations was 135 Mflops/s, on 16k processors 265 Mflops/s and 525 Mflops/s on 32k processors. The efficiency is independent of the machine size. The evaluation of the boundary conditions accounted for less than 5% of the total time. A performance improvement by a factor of about three is expected with optimized implementations of functional kernels such as convolution, and matrix-vector multiplication.

Explicit methods for the solution of fluid flow problems are of considerable interest in supercomputing. These methods parallelize well. The treatment of the boundaries is of particular interest with respect to both the numeric behavior of the solution and the computational efficiency. We have solved the three-dimensional Euler equations for a twisted channel using second-order, centered difference operators, and a three-stage Runge-Kutta method for the integration. Three different fourth-order dissipation operators were studied for numeric stabilization: one positive definite, one positive semidefinite, and one indefinite. The operators only differ in the treatment of the boundary. For computational efficiency all dissipation operators were designed with a constant bandwidth in matrix representation, with the bandwidth determined by the operator in the interior. The positive definite dissipation operator results in a significant growth in entropy close to the channel walls. The other operators maintain constant entropy. Several different implementations of the semidefinite operator obtained through factoring of the operator were also studied. We show the difference both in convergence rate and robustness for the different dissipation operators, and the factorizations of the operator due to Eriksson. For the simulations in this study one of the factorizations of the semidefinite operator required 70%–90% of the number of iterations required by the positive definite operator. The indefinite operator was sensitive to perturbations in the inflow boundary conditions. The simulations were performed on a 8,192 processor Connection Machine system CM-2. Full processor utilization was achieved, and a performance of 135 Mflops/sec in single precision was obtained. A performance of 1.1 Gflops/sec for a fully configured system with 65,536 processors was demonstrated.

Connection Machine System Research Articles

Related Topics

Articles published on Connection Machine System

The Parallel Multipole Method on the Connection Machine

Massively parallel switch-level simulation: a feasibility study

A dataparallel implementation of an explicit method for the three-dimensional compressible Navier-Stokes equations

Data structures and algorithms for the finite element method on a data parallel supercomputer

Histogram computation on distributed memory architectures

Boundary modifications of the dissipation operators for the three-dimensional Euler equations

THE FINITE ELEMENT METHOD ON A DATA PARALLEL COMPUTING SYSTEM

Scans as primitive parallel operations

Two-&-Two, a high level system for retrieving pairs of documents

Special issue on parallelism

Parallel free-text search on the connection machine system

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Connection Machine System Research Articles

Related Topics

Articles published on Connection Machine System

The Parallel Multipole Method on the Connection Machine

Massively parallel switch-level simulation: a feasibility study

A dataparallel implementation of an explicit method for the three-dimensional compressible Navier-Stokes equations

Data structures and algorithms for the finite element method on a data parallel supercomputer

Histogram computation on distributed memory architectures

Boundary modifications of the dissipation operators for the three-dimensional Euler equations

THE FINITE ELEMENT METHOD ON A DATA PARALLEL COMPUTING SYSTEM

Scans as primitive parallel operations

Two-&amp;-Two, a high level system for retrieving pairs of documents

Special issue on parallelism

Parallel free-text search on the connection machine system

Two-&-Two, a high level system for retrieving pairs of documents