Parallel Vector Processor Research Articles

Double-Fourier-Series (DFS) spectral method is applied to a large-size problem of barotropic instability of double-shear flow on the sphere. The computing source is the NEC SX-5 parallel vector processors, with the maximum vector length of 512. It is demonstrated that the DFS spectral model is robust and stable even for such a large-sized intensively nonlinear problem, and can simulate well the multiple scale phenomenon without losing accuracy. In addition to the efficiency on serial computing, represented with O(N² log2 N) operations as opposed to O(N³) for the spherical harmonics spectral method, with N the truncation, the DFS spectral model also preserves the efficiency on parallel computing on vector architecture, due to its nature of two dimensional transform. The parallel performance increased slightly with the resolution, and nearly 33.5 percent (26.8 GFLOPS) of the theoretical peak performance (80 GFLOPS) was achieved in the highest-resolution experiment. The zonal-mean absolute vorticity, of which initial condition is characterized as two peaks in both hemispheres, evolves with time into a nearly constant value over the hemisphere. On the other hand, the meridional gradient of the absolute vorticity increases around the equator. The kinetic energy per unit mass is calculated for each total wavenumber, where a disturbance field of a single total wavenumber is separated by an 8th-order spherical harmonics filter. Kinetic energy spectrum shows two distinct subranges, each with a constant slope. The subrange, other than the viscous subrange, shows a slightly increasing slope with time and approaches l-3 (l is the total wavenumber) in the matured stage, when a single large vortex is formed. As the resolution increases, the subrange other than the viscous subrange extends to the higher wavenumber domain, due to low viscosity. Numerical convergence of the solution with a fixed viscosity is discussed in terms of time averaged zonal-mean statistics of the zonal-flow.

Read full abstract

EMAP3D (Electromechanical Analysis Program in Three Dimensions) is a single-processor (serial) program that uses finite element (FE) methods to solve coupled electromagnetic, thermal and structural problems for high velocity conductors in transient electromagnetic fields. Its primary application has been the simulation of electromagnetic launchers and pulsed rotating machines. While ETVAP3D has been applied successfully to a wide range of problems, its serial execution time limits the achievement of finer detail in large problems, even on high performance vector machines. The present class of production simulations, involving 100,000 unknowns, can take a week to complete on time-sharing machines at computation centers. To reduce simulation time, a parallel implementation of the EMAPSD program has been undertaken. While vector parallel programming is straightforward and a reasonable approach, access to more than 16 vector processors is limited. The availability, low cost, and scalability of massively parallel processing (MPP) make MPP computing more attractive than vector-parallel processing. The number of MPP processors on PC Beowulf clusters usually ranges from 8 to 100 and can be as high as several thousand on IBM (SP) and Gray (T3E) systems. The authors have decoupled the matrix generation and components of the FE algorithm, thereby allowing us to use any of the new MPP-parallel solvers on the matrix equations. Since the number of zero matrix elements is high for EMAPSD problems, a sparse matrix solver is ideal. Hence, their parallel implementation uses the sparse iterative solvers of PETSc (Portable Extensible Toolkit for Scientific Computing). Here, they report the performance, scalability and use of PETSc preconditioners and solver algorithms as a solution engine for real EMAP3D simulations and test cases.

Read full abstract

Parallel Vector Processor Research Articles

Related Topics

Articles published on Parallel Vector Processor

PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit

A Fast Algorithm to Estimate the Deepest Points of Lakes for Regional Lake Registration.

Operational Wind Wave Prediction System at KMA

Large scale FMO-MP2 calculations on a massively parallel-vector computer

Allostery of the two‐state model of hemoglobin studied by ECEPP energy minimization

Multibillion-atom molecular dynamics simulation: Design considerations for vector-parallel processing

Assessment of computational performance for a vector parallel implementation: 3D probabilistic model discrete cracking in concrete

Application of Double-Fourier-series Spectral Method to a Large Size Problem: Two-dimensional Simulations of the Shear Instability on the Sphere

Large scale atomistic polymer simulations using Monte Carlo methods for parallel vector processors

A parallel 3D unsteady incompressible flow solver on VPP700

Adapting EMAP3D to parallel processing [for EM launcher modelling

Multiprocessor design options and the Silicon Graphics S2MP architecture

Automatic differentiation for design sensitivity analysis of structural systems using parallel-vector processors

Difficulties in Vector-Parallel Processing of Monte Carlo Codes

NON-LINEAR STRUCTURAL RESPONSE USING ADAPTIVE DYNAMIC RELAXATION ON A MASSIVELY PARALLEL-PROCESSING SYSTEM

NON‐LINEAR STRUCTURAL RESPONSE USING ADAPTIVE DYNAMIC RELAXATION ON A MASSIVELY PARALLEL‐PROCESSING SYSTEM

Weather forecasting on parallel architectures

Adaptive dynamic relaxation algorithm for non-linear hyperelastic structures Part II. Single-processor implementation

Parallel processing in Japan: national and corporate trends

Implementation of a synthetic aperture processing scheme in a towed array sonar system

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Parallel Vector Processor Research Articles

Related Topics

Articles published on Parallel Vector Processor

PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit

A Fast Algorithm to Estimate the Deepest Points of Lakes for Regional Lake Registration.

Operational Wind Wave Prediction System at KMA

Large scale FMO-MP2 calculations on a massively parallel-vector computer

Allostery of the two‐state model of hemoglobin studied by ECEPP energy minimization

Multibillion-atom molecular dynamics simulation: Design considerations for vector-parallel processing

Assessment of computational performance for a vector parallel implementation: 3D probabilistic model discrete cracking in concrete

Application of Double-Fourier-series Spectral Method to a Large Size Problem: Two-dimensional Simulations of the Shear Instability on the Sphere

Large scale atomistic polymer simulations using Monte Carlo methods for parallel vector processors

A parallel 3D unsteady incompressible flow solver on VPP700

Adapting EMAP3D to parallel processing [for EM launcher modelling

Multiprocessor design options and the Silicon Graphics S2MP architecture

Automatic differentiation for design sensitivity analysis of structural systems using parallel-vector processors

Difficulties in Vector-Parallel Processing of Monte Carlo Codes

NON-LINEAR STRUCTURAL RESPONSE USING ADAPTIVE DYNAMIC RELAXATION ON A MASSIVELY PARALLEL-PROCESSING SYSTEM

NON‐LINEAR STRUCTURAL RESPONSE USING ADAPTIVE DYNAMIC RELAXATION ON A MASSIVELY PARALLEL‐PROCESSING SYSTEM

Weather forecasting on parallel architectures

Adaptive dynamic relaxation algorithm for non-linear hyperelastic structures Part II. Single-processor implementation

Parallel processing in Japan: national and corporate trends

Implementation of a synthetic aperture processing scheme in a towed array sonar system