A comparative analysis of cache designs for vector processing

Tong Sun Tong Sun,Qing Yang Qing Yang

doi:10.1109/12.754999

Abstract

This paper presents an experimental study on cache memory designs for vector computers. We use an execution-driven simulator to evaluate vector cache performance of a set of application programs from Perfect Club and SPEC92 benchmark suites. Our simulation results uncover a few important facts which were unknown before: First of all, the prime-mapped cache that we newly proposed shows great performance potential in vector processing environment. Because of its conflict-free property, the prime-mapped cache performs significantly better than conventional cache designs for all applications considered. Second, performance results on the benchmarks indicate that data locality in vector processing does exist, although the effects of line size, associativity, replacement algorithm, and prefetching scheme on cache performance are very different from what has been commonly believed. A medium size vector cache (e.g., 128 Kbytes) eliminates the necessity for a large number of interleaved memory banks in vector computers. Our experiments show that the vector computer that has a medium size prime-mapped cache with small cache line size and limited amount of prefetching provides significant speedup over conventional vector computers without cache. Performance results reported in this paper can also provide guidance to general-purpose computer designers to enhance cache performance for numerical applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparative analysis of cache designs for vector processing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: Mar 1, 1999
Citations: 17

Similar Papers

Characteristics of an On-Chip Cache on NEC SX Vector Architecture
Akihiro Musa ...
Interdisciplinary Information Sciences | VOL. 15
Akihiro Musa, et. al.Akihiro Musa ...
01 Jan 2009
Interdisciplinary Information Sciences | VOL. 15

Effects of MSHR and Prefetch Mechanisms on an On-Chip Cache of the Vector Architecture
Akihoro Musa ... Hiroyuki Takizawa
-
Akihoro Musa, et. al.Akihoro Musa ... Hiroyuki Takizawa
01 Dec 2008
01 Dec 2008

Cache partitioning strategies for 3-D stacked vector processors
Yusuke Funaya ... Hiroaki Kobayashi
-
Yusuke Funaya, et. al.Yusuke Funaya ... Hiroaki Kobayashi
01 Nov 2010
01 Nov 2010

Review of general and Toeplitz vector bidiagonal solvers
Josep-Lulis Larriba-Pey ... Oriol Roig
Parallel Computing | VOL. 22
Josep-Lulis Larriba-Pey, et. al.Josep-Lulis Larriba-Pey ... Oriol Roig
01 Oct 1996
Parallel Computing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparative analysis of cache designs for vector processing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers