Portable inter-workgroup barrier synchronisation for GPUs

Tyler Sorensen,Ganesh Gopalakrishnan,Mark Batty,Alastair F Donaldson,Zvonimir Rakamarić

doi:10.1145/2983990.2984032

Portable inter-workgroup barrier synchronisation for GPUs

Tyler Sorensen, Ganesh Gopalakrishnan + Show 3 more

Open Access

https://doi.org/10.1145/2983990.2984032

Copy DOI

Publication Date: Oct 19, 2016

Citations: 45

Affiliation: Imperial College London, University of Utah, University of Kent

#Atomic Operations #Traditional Barriers + Show 6 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Despite the growing popularity of GPGPU programming, there is not yet a portable and formally-specified barrier that one can use to synchronise across workgroups. Moreover, the occupancy-bound execution model of GPUs breaks assumptions inherent in traditional software execution barriers, exposing them to deadlock. We present an occupancy discovery protocol that dynamically discovers a safe estimate of the occupancy for a given GPU and kernel, allowing for a starvation-free (and hence, deadlock-free) inter-workgroup barrier by restricting the number of workgroups according to this estimate. We implement this idea by adapting an existing, previously non-portable, GPU inter-workgroup barrier to use OpenCL 2.0 atomic operations, and prove that the barrier meets its natural specification in terms of synchronisation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.