Reducing cache misses through programmable decoders

Chuanjun Zhang

doi:10.1145/1328195.1328200

Abstract

Level-one caches normally reside on a processor's critical path, which determines clock frequency. Therefore, fast access to level-one cache is important. Direct-mapped caches exhibit faster access time, but poor hit rates, compared with same sized set-associative caches because of nonuniform accesses to the cache sets. The nonuniform accesses generate more cache misses in some sets, while other sets are underutilized. We propose to increase the decoder length and, hence, reduce the accesses to heavily used sets without dynamically detecting the cache set usage information. We increase the access to the underutilized cache sets by incorporating a replacement policy into the cache design using programmable decoders. On average, the proposed techniques achieve as low a miss rate as a traditional 4-way cache on all 26 SPEC2K benchmarks for the instruction and data caches, respectively. This translates into an average IPC improvement of 21.5 and 42.4% for SPEC2K integer and floating-point benchmarks, respectively. The B-Cache consumes 10.5% more power per access, but exhibits a 12% total memory access-related energy savings as a result of the miss rate reductions, and, hence, the reduction to applications' execution time. Compared with previous techniques that aim at reducing the miss rate of direct-mapped caches, our technique requires only one cycle to access all cache hits and has the same access time of a direct-mapped cache.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reducing cache misses through programmable decoders

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization

Lead the way for us

Journal: ACM Transactions on Architecture and Code Optimization	Publication Date: Jan 1, 2008
Citations: 21

Similar Papers

Balanced Cache
Chuanjun Zhang
ACM SIGARCH Computer Architecture News | VOL. 34
Chuanjun ZhangChuanjun Zhang
01 May 2006
ACM SIGARCH Computer Architecture News | VOL. 34

Memory performance prediction for high-performance microprocessors at deep submicrometer technologies
A Zeng ... K Rose
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 25
A Zeng, et. al.A Zeng ... K Rose
01 Sep 2006
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 25

고성능 데이터 캐시 메모리 구조
Hong-Sik Kim ... Cheong-Ghil Kim
Journal of the Korea Academia-Industrial cooperation Society | VOL. 9
Hong-Sik Kim, et. al.Hong-Sik Kim ... Cheong-Ghil Kim
31 Aug 2008
Journal of the Korea Academia-Industrial cooperation Society | VOL. 9

Snug set-associative caches
Jia-Jhe Li ... Yuan-Shin Hwang
-
Jia-Jhe Li, et. al.Jia-Jhe Li ... Yuan-Shin Hwang
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reducing cache misses through programmable decoders

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization