An implementation of loop fusion for improving performance and energy consumption of shared-memory parallel codes

Iulia Stirb

doi:10.1109/iccp.2017.8117057

Abstract

State-of-the-art Low Level Virtual Machine (LLVM) compiler infrastructure has a dedicated set of optimizations for loops. Each optimization is organized as a separate pass in LLVM, whereas passes are created using a mix of object creational patterns. However, recent focus of modern compilers is in improving runtime performance using a large set of conservative optimizations, most often omitting the energy consumption impact. This paper introduces the implementation of a new loop fusion algorithm designed for LLVM, which aims to improve both runtime performance and energy consumption of parallel codes involving loop parallelism. The algorithm proposed merges two non dependent loops with the same number of iterations and without any code between. Two loops are dependent when the first loop has to finish for the second to start, whereas two independent loops may not be allowed to be executed in parallel. This paper also proves that loop fusion is useful in optimizing loop parallelism, since the fusion of two loops cuts in half the number of threads that would otherwise be required to execute each iteration (when there is a one-to-one relation between threads and iterations). The decreased number of threads reduces the parallelization overhead which in turn improves the energy consumption. The improvements are discussed in the context of Non-Uniform Memory Access (NUMA) systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An implementation of loop fusion for improving performance and energy consumption of shared-memory parallel codes

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving performance and energy consumption with loop fusion optimization and parallelization
Iulia Stirb ... Horia Ciocarlie
-
Iulia Stirb, et. al.Iulia Stirb ... Horia Ciocarlie
01 Nov 2016
01 Nov 2016

A Low-Level Virtual Machine Just-In-Time Prototype for Running an Energy-Saving Hardware-Aware Mapping Algorithm on C/C++ Applications That Use Pthreads
Iulia Știrb ... Gilbert-Rainer Gillich
Energies | VOL. 16
Iulia Știrb, et. al.Iulia Știrb ... Gilbert-Rainer Gillich
23 Sep 2023
Energies | VOL. 16

LLVMVF: A Generic Approach for Verification of Multicore Software
Marcelo Sousa ... Alper Sen
Journal of Electronic Testing | VOL. 29
Marcelo Sousa, et. al.Marcelo Sousa ... Alper Sen
07 Sep 2013
Journal of Electronic Testing | VOL. 29

A Translation Framework for Automatic Translation of Annotated LLVM IR into OpenCL Kernel Function
Chen-Ting Chang ... I-Wei Wu
-
Chen-Ting Chang, et. al.Chen-Ting Chang ... I-Wei Wu
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An implementation of loop fusion for improving performance and energy consumption of shared-memory parallel codes

Abstract

Talk to us

Similar Papers