Mixed Precision Iterative Refinement Research Articles

AbstractLow precision arithmetic, in particular half precision (16‐bit) floating point arithmetic, is now available in commercial hardware. Using lower precision can offer significant savings in computation and communication costs with proportional savings in energy. Motivated by this, there has been a renewed interest in mixed precision iterative refinement schemes for solving linear systems , and new variants of GMRES‐based iterative refinement have been developed. Each particular variant with a given combination of precisions leads to different condition number‐based constraints for convergence of the backward and forward errors, and each has different performance costs. The constraints for convergence given in the literature are, as an artifact of the analyses, often overly strict in practice, and thus could lead a user to select a more expensive variant when a less expensive one would have sufficed. In this work, we develop a multistage mixed precision iterative refinement solver which aims to combine existing mixed precision approaches to balance performance and accuracy and improve usability. For a user‐specified initial combination of precisions, the algorithm begins with the least expensive approach and convergence is monitored via inexpensive computations with quantities produced during the iteration. If slow convergence or divergence is detected using particular stopping criteria, the algorithm switches to use a more expensive, but more reliable variant. A novel aspect of our approach is that, unlike existing implementations, our algorithm first attempts to use “stronger” GMRES‐based solvers for the solution update before resorting to increasing the precision(s). In some scenarios, this can avoid the need to refactorize the matrix in higher precision. We perform extensive numerical experiments on a variety of random dense problems and problems from real applications which confirm the benefits of the multistage approach.

Read full abstract

In this survey paper, we compare native double precision solvers with emulated- and mixed-precision solvers of linear systems of equations as they typically arise in finite element discretisations. The emulation utilises two single float numbers to achieve higher precision, while the mixed precision iterative refinement computes residuals and updates the solution vector in double precision but solves the residual systems in single precision. Both techniques have been known since the 1960s, but little attention has been devoted to their performance aspects. Motivated by changing paradigms in processor technology and the emergence of highly-parallel devices with outstanding single float performance, we adapt the emulation and mixed precision techniques to coupled hardware configurations, where the parallel devices serve as scientific co-processors. The performance advantages are examined with respect to speedups over a native double precision implementation (time aspect) and reduced area requirements for a chip (space aspect). The paper begins with an overview of the theoretical background, algorithmic approaches and suitable hardware architectures. We then employ several conjugate gradient (CG) and multigrid solvers and study their behaviour for different parameter settings of the iterative refinement technique. Concrete speedup factors are evaluated on the coupled hardware configuration of a general-purpose CPU and a graphics processor. The dual performance aspect of potential area savings is assessed on a field programmable gate array (FPGA). In the last part, we test the applicability of the proposed mixed precision schemes with ill-conditioned matrices. We conclude that the mixed precision approach works very well with the parallel co-processors gaining speedup factors of four to five, and area savings of three to four, while maintaining the same accuracy as a reference solver executing everything in double precision.

Read full abstract

Mixed Precision Iterative Refinement Research Articles

Articles published on Mixed Precision Iterative Refinement

Efficient Mixed-Precision Matrix Factorization of the Inverse Overlap Matrix in Electronic Structure Calculations with AI-Hardware and GPUs.

Acceleration of iterative refinement for singular value decomposition

Mixed Precision Iterative Refinement with Sparse Approximate Inverse Preconditioning

Combining Sparse Approximate Factorizations with Mixed-precision Iterative Refinement

Multistage mixed precision iterative refinement

Evaluating Performance of Mixed Precision Linear Solvers with Iterative Refinement

Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems.

AIR: Iterative refinement acceleration using arbitrary dynamic precision

Energy-Efficient Iterative Refinement Using Dynamic Precision

Parallel Transient Stability-Constrained Optimal Power Flow Using GPU as Coprocessor

倍精度と多倍長精度浮動小数点数を用いた反復改良法による連立一次方程式の高精度高速解法について(行列・固有値問題の解法とその応用, 平成21年研究部会連合発表会)

A fast, hybrid, power-efficient high-precision solver for large linear systems based on low-precision hardware

Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors

Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms

High-Performance Mixed-Precision Linear Solver for FPGAs

Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems

Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations

Iterative refinement for constrained and weighted linear least squares

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mixed Precision Iterative Refinement Research Articles

Articles published on Mixed Precision Iterative Refinement

Efficient Mixed-Precision Matrix Factorization of the Inverse Overlap Matrix in Electronic Structure Calculations with AI-Hardware and GPUs.

Acceleration of iterative refinement for singular value decomposition

Mixed Precision Iterative Refinement with Sparse Approximate Inverse Preconditioning

Combining Sparse Approximate Factorizations with Mixed-precision Iterative Refinement

Multistage mixed precision iterative refinement

Evaluating Performance of Mixed Precision Linear Solvers with Iterative Refinement

Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems.

AIR: Iterative refinement acceleration using arbitrary dynamic precision

Energy-Efficient Iterative Refinement Using Dynamic Precision

Parallel Transient Stability-Constrained Optimal Power Flow Using GPU as Coprocessor

倍精度と多倍長精度浮動小数点数を用いた反復改良法による連立一次方程式の高精度高速解法について(行列・固有値問題の解法とその応用, 平成21年研究部会連合発表会)

A fast, hybrid, power-efficient high-precision solver for large linear systems based on low-precision hardware

Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors

Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms

High-Performance Mixed-Precision Linear Solver for FPGAs

Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems

Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations

Iterative refinement for constrained and weighted linear least squares