Efficient evaluation methods of elementary functions suitable for SIMD computation

Naoki Shibata

doi:10.1007/s00450-010-0108-2

Abstract

Data-parallel architectures like SIMD (Single Instruction Multiple Data) or SIMT (Single Instruction Multiple Thread) have been adopted in many recent CPU and GPU architectures. Although some SIMD and SIMT instruction sets include double-precision arithmetic and bitwise operations, there are no instructions dedicated to evaluating elementary functions like trigonometric functions in double precision. Thus, these functions have to be evaluated one by one using an FPU or using a software library. However, traditional algorithms for evaluating these elementary functions involve heavy use of conditional branches and/or table look-ups, which are not suitable for SIMD computation. In this paper, efficient methods are proposed for evaluating the sine, cosine, arc tangent, exponential and logarithmic functions in double precision without table look-ups, scattering from, or gathering into SIMD registers, or conditional branches. We implemented these methods using the Intel SSE2 instruction set to evaluate their accuracy and speed. The results showed that the average error was less than 0.67 ulp, and the maximum error was 6 ulps. The computation speed was faster than the FPUs on Intel Core 2 and Core i7 processors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient evaluation methods of elementary functions suitable for SIMD computation

Abstract

Talk to us

Similar Papers

More From: Computer Science - Research and Development

Lead the way for us

Journal: Computer Science - Research and Development	Publication Date: Apr 20, 2010
Citations: 22

Similar Papers

Embedded GPU and multicore processors for emotional-based mobile robotic agents
Francisco Almenar ... Pedro López
Future Generation Computer Systems | VOL. 56
Francisco Almenar, et. al.Francisco Almenar ... Pedro López
12 Jun 2015
Future Generation Computer Systems | VOL. 56

Efficient SIMD optimization for media processors
Jian-Peng Zhou ... Ce Shi
Journal of Zhejiang University-SCIENCE A | VOL. 9
Jian-Peng Zhou, et. al.Jian-Peng Zhou ... Ce Shi
01 Apr 2008
Journal of Zhejiang University-SCIENCE A | VOL. 9

Accelerating Random Network Coding using 512-bit SIMD Instructions
Seo-Ran Shin ... Se-Yeon Choo
-
Seo-Ran Shin, et. al.Seo-Ran Shin ... Se-Yeon Choo
01 Oct 2019
01 Oct 2019

FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations
Kun Li ... Shigang Li
The Journal of Supercomputing | VOL. 76
Kun Li, et. al.Kun Li ... Shigang Li
29 Apr 2019
The Journal of Supercomputing | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient evaluation methods of elementary functions suitable for SIMD computation

Abstract

Talk to us

Similar Papers

More From: Computer Science - Research and Development