Meta-implementation of vectorized logarithm function in binary floating-point arithmetic

Hugues De Lassus Saint-Genies,Nicolas Brunie,Guillaume Revy

doi:10.1109/asap.2018.8445102

Abstract

Besides scalar instructions, modern micro-architectures also provide support for vector instructions. They enable to treat packed inputs (typically 4 or 8) in a single instruction. The challenge is now to write vector programs to support mathematical functions like sin, cos, exp, log, … which efficiently exploit those vector instructions. This article focuses on the design of vectorized implementation of log(x) function, and more particularly on its automation for different formats and micro-architectures. First it rewrites a classic range reduction in a branchless fashion so as to use at best recent micro-architecture features, like rcp (reciprocal) instruction, and to treat all inputs in the same flow. Second it details rigorously how to achieve “faithfully rounded” implementations. Third it shows how to automate this implementation process using the MetaLibm framework, on SSE/AVX and AVX2 supporting micro-architectures. Finally we illustrate that this process enables to achieve high throughput implementations for the binary32 and binary64 formats in a fully automated way.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Meta-implementation of vectorized logarithm function in binary floating-point arithmetic

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jul 1, 2018
Citations: 20	License type: other-oa

Similar Papers

Memory hierarchy design for Jetpipeline: to execute scalar and vector instructions in parallel
T Sasaki ... M Katahira
-
T Sasaki, et. al.T Sasaki ... M Katahira
17 Mar 1997
17 Mar 1997

Higher Radix Floating-Point Representations for FPGA-Based Arithmetic
B Catanzaro ... B Nelson
-
B Catanzaro, et. al.B Catanzaro ... B Nelson
25 Jul 2014
25 Jul 2014

Balancing Scalar and Vector Execution on GPU Architectures
Zhongliang Chen ... David Kaeli
-
Zhongliang Chen, et. al.Zhongliang Chen ... David Kaeli
01 May 2016
01 May 2016

Basics of Floating–Point Quantization
Bernard Widrow ...
-
Bernard Widrow, et. al.Bernard Widrow ...
03 Jul 2008
03 Jul 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Meta-implementation of vectorized logarithm function in binary floating-point arithmetic

Abstract

Talk to us

Similar Papers