GPU-STREAM v2.0: Benchmarking the Achievable Memory Bandwidth of Many-Core Processors Across Diverse Parallel Programming Models

Tom Deakin,Matt Martineau,James Price,Simon Mcintosh-Smith

doi:10.1007/978-3-319-46079-6_34

GPU-STREAM v2.0: Benchmarking the Achievable Memory Bandwidth of Many-Core Processors Across Diverse Parallel Programming Models

Tom Deakin, Matt Martineau + Show 2 more

Open Access

PDF Available

https://doi.org/10.1007/978-3-319-46079-6_34

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2016

Citations: 56

Affiliation: University of Bristol

#General Purpose Graphics Processing Units #Peak Memory Bandwidth #Intel Xeon Phi #Arithmetic Logic Units #Memory Bandwidth #Traditional Architectures #General Processing #Many-core Devices #Graphics Processing Units #Peak Memory

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Many scientific codes consist of memory bandwidth bound kernels — the dominating factor of the runtime is the speed at which data can be loaded from memory into the Arithmetic Logic Units, before results are written back to memory. One major advantage of many-core devices such as General Purpose Graphics Processing Units (GPGPUs) and the Intel Xeon Phi is their focus on providing increased memory bandwidth over traditional CPU architectures. However, as with CPUs, this peak memory bandwidth is usually unachievable in practice and so benchmarks are required to measure a practical upper bound on expected performance.

Full Text

Submitted Version (Free)

View/Download pdf

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

GPU-STREAM v2.0: Benchmarking the Achievable Memory Bandwidth of Many-Core Processors Across Diverse Parallel Programming Models