Data-flow analysis and optimization for data coherence in heterogeneous architectures

Rafael Sousa,Marcio Pereira,Fernando Magno Quintão Pereira,Guido Araujo

doi:10.1016/j.jpdc.2019.04.004

Abstract

Although heterogeneous computing has enabled developers to achieve impressive program speed-ups, the cost of moving and keeping data coherent between host and device may easily eliminate any performance gains achieved by acceleration. To deal with this problem, this paper introduces DCA: a pair of two data-flow analyses that determine how variables are used by host/device at each program point. It also introduces DCO, a code optimization technique that uses DCA information to: (a) allocate OpenCL shared buffers between host and devices; and (b) insert appropriate OpenCL function calls into program points so as to minimize the number of data coherence operations. We have used the AClang compiler to measure the impact of DCA and DCO when generating code from Parboil, Polybench and Rodinia benchmarks for a set of discrete/integrated GPUs. The experimental results showed speed-ups of up to 5.25x (average of 1.39x) on an ARM Mali-T880 and up to 8.87x (average of 1.66x) on an NVIDIA GPU Pascal Titan X.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-flow analysis and optimization for data coherence in heterogeneous architectures

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing

Lead the way for us

Journal: Journal of Parallel and Distributed Computing	Publication Date: Apr 10, 2019
Citations: 4

Similar Papers

Data Coherence Analysis and Optimization for Heterogeneous Computing
Rafael Sousa ... Marcio Pereira
-
Rafael Sousa, et. al.Rafael Sousa ... Marcio Pereira
01 Oct 2017
01 Oct 2017

Art Installation Design and Algorithm Research Oriented to Heterogeneous Computing Architecture and Particle Swarm Algorithm
Fanyu Meng
IEEE Consumer Electronics Magazine | VOL. 12
Fanyu MengFanyu Meng
01 Mar 2023
IEEE Consumer Electronics Magazine | VOL. 12

Smart Coding using New Code Optimization Techniques in Java to Reduce Runtime Overhead of Java Compiler
Prajakta Gotarane ... Sumedh Pundkar
International Journal of Computer Applications | VOL. 125
Prajakta Gotarane, et. al.Prajakta Gotarane ... Sumedh Pundkar
17 Sep 2015
International Journal of Computer Applications | VOL. 125

Automatically Migrating Sequential Applications to Heterogeneous System Architecture
Chih-Yung Liang ... Wei-Chung Hsu
-
Chih-Yung Liang, et. al.Chih-Yung Liang ... Wei-Chung Hsu
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-flow analysis and optimization for data coherence in heterogeneous architectures

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing