Improving Strong-Scaling on GPU Cluster Based on Tightly Coupled Accelerators Architecture

Toshihiro Hanawa,Norihisa Fujita,Tetsuya Odajima,Hisafumi Fujii,Kazuya Matsumoto,Taisuke Boku,Yuetsu Kodama

doi:10.1109/cluster.2015.154

Improving Strong-Scaling on GPU Cluster Based on Tightly Coupled Accelerators Architecture

Toshihiro Hanawa, Norihisa Fujita + Show 5 more

https://doi.org/10.1109/cluster.2015.154

Copy DOI

Publication Date: Sep 1, 2015

Affiliation: The University of Tokyo, University of Tsukuba

#Tightly Coupled Accelerators #GPU Cluster + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The Tightly Coupled Accelerators (TCA) architecture that we proposed in previous work enables direct ommunication between accelerators over nodes. In this paper, we present a proof-of-concept GPU cluster called the HA-PACS/TCA using the PEACH2 chip that we designed as an interconnection router chip based on the TCA architecture. Our system demonstrated 2.0 ?sec of latency on inter-node GPU-to-GPU communication with a PCIe Gen2 x8 by RDMA, reducing minimum latency to just 44% of the InfiniBand-QDR and MPI using GPUDirect for RDMA. Through results of Himeno benchmark tests, we demonstrated that our TCA architecture improved performance scalability with the small-sized problem by up to 61%.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.