Ocelot

Gregory Frederick Diamos,Sudhakar Yalamanchili,Andrew Robert Kerr,Nathan Clark

doi:10.1145/1854273.1854318

Abstract

Ocelot is a dynamic compilation framework designed to map the explicitly data parallel execution model used by NVIDIA CUDA applications onto diverse multithreaded platforms. Ocelot includes a dynamic binary translator from Parallel Thread eXecution ISA (PTX) to many-core processors that leverages the Low Level Virtual Machine (LLVM) code generator to target x86 and other ISAs. The dynamic compiler is able to execute existing CUDA binaries without recompilation from source and supports switching between execution on an NVIDIA GPU and a many-core CPU at runtime. It has been validated against over 130 applications taken from the CUDA SDK, the UIUC Parboil benchmarks [1], the Virginia Rodinia benchmarks [2], the GPU-VSIPL signal and image processing library [3], the Thrust library [4], and several domain specific applications. This paper presents a high level overview of the implementation of the Ocelot dynamic compiler highlighting design decisions and trade-offs, and showcasing their effect on application performance. Several novel code transformations are explored that are applicable only when compiling explicitly parallel applications and traditional dynamic compiler optimizations are revisited for this new class of applications. This study is expected to inform the design of compilation tools for explicitly parallel programming models (such as OpenCL) as well as future CPU and GPU architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ocelot

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Static Dalvik Bytecode Optimization for Android Applications
Jeehong Kim ... Young Ik Eom
ETRI Journal | VOL. 37
Jeehong Kim, et. al.Jeehong Kim ... Young Ik Eom
01 Oct 2015
ETRI Journal | VOL. 37

Intel's Array Building Blocks: A retargetable, dynamic compiler and embedded language
...
-
, et. al. ...
02 Apr 2011
02 Apr 2011

Intel's Array Building Blocks: A retargetable, dynamic compiler and embedded language
Chris J Newburn ... Anwar Ghuloum
-
Chris J Newburn, et. al.Chris J Newburn ... Anwar Ghuloum
01 Apr 2011
01 Apr 2011

Enhancing R with Advanced Compilation Tools and Methods
Duncan Temple Lang
Statistical Science | VOL. 29
Duncan Temple LangDuncan Temple Lang
01 May 2014
Statistical Science | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ocelot

Abstract

Talk to us

Similar Papers