Efficient Hardware Barrier Synchronization in Many-Core CMPs

Jose L Abellan,Juan Fernandez,Manuel E Acacio

doi:10.1109/tpds.2011.304

Abstract

Traditional software-based barrier implementations for shared memory parallel machines tend to produce hotspots in terms of memory and network contention as the number of processors increases. This could limit their applicability to future many-core CMPs in which possibly several dozens of cores would need to be synchronized efficiently. In this work, we develop GBarrier, a hardware-based barrier mechanism especially aimed at providing efficient barriers in future many-core CMPs. Our proposal deploys a dedicated G-line-based network to allow for fast and efficient signaling of barrier arrival and departure. Since GBarrier does not have any influence on the memory system, we avoid all coherence activity and barrier-related network traffic that traditional approaches introduce and that restrict scalability. Through detailed simulations of a 32-core CMP, we compare GBarrier against one of the most efficient software-based barrier implementations for a set of kernels and scientific applications. Evaluation results show average reductions of 54 and 21 percent in execution time, 53 and 18 percent in network traffic, and also 76 and 31 percent in the energy-delay <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> product metric for the full CMP when the kernels and scientific applications, respectively, are considered.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Hardware Barrier Synchronization in Many-Core CMPs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Aug 1, 2012
Citations: 50

Similar Papers

A G-Line-Based Network for Fast and Efficient Barrier Synchronization in Many-Core CMPs
Jose L Abellan ... Juan Fernandez
-
Jose L Abellan, et. al.Jose L Abellan ... Juan Fernandez
01 Sep 2010
01 Sep 2010

Efficient Self-Invalidation/Self-Downgrade for Critical Sections with Relaxed Semantics
Alberto Ros ... Stefanos Kaxiras
IEEE Transactions on Parallel and Distributed Systems | VOL. 28
Alberto Ros, et. al.Alberto Ros ... Stefanos Kaxiras
01 Dec 2017
IEEE Transactions on Parallel and Distributed Systems | VOL. 28

Redundancy in model specifications for discrete event simulation
Richard E Nance ... C Michael Overstreet
ACM Transactions on Modeling and Computer Simulation | VOL. 9
Richard E Nance, et. al.Richard E Nance ... C Michael Overstreet
01 Jul 1999
ACM Transactions on Modeling and Computer Simulation | VOL. 9

GLocks: Efficient Support for Highly-Contended Locks in Many-Core CMPs
Jose L. Abell´n ... Manuel E. Acacio
-
Jose L. Abell´n, et. al.Jose L. Abell´n ... Manuel E. Acacio
01 May 2011
01 May 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Hardware Barrier Synchronization in Many-Core CMPs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems