CPU cache prefetching: Timing evaluation of hardware implementations

J Tse,A.J Smith

doi:10.1109/12.677225

Abstract

Prefetching into CPU caches has long been known to be effective in reducing the cache miss ratio, but known implementations of prefetching have been unsuccessful in improving CPU performance. The reasons for this are that prefetches interfere with normal cache operations by making cache address and data ports busy, the memory bus busy, the memory banks busy, and by not necessarily being complete by the time that the prefetched data is actually referenced. In this paper, we present extensive quantitative results of a detailed cycle-by-cycle trace-driven simulation of a uniprocessor memory system in which we vary most of the relevant parameters in order to determine when and if hardware prefetching is useful. We find that, in order for prefetching to actually improve performance, the address array needs to be double ported and the data array needs to either be double ported or fully buffered. It is also very helpful for the bus to be very wide (e.g., 16 bytes) for bus transactions to be split and for main memory to be interleaved. Under the best circumstances, i.e., with a significant investment in extra hardware, prefetching can significantly improve performance. For implementations without adequate hardware, prefetching often decreases performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CPU cache prefetching: Timing evaluation of hardware implementations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: May 1, 1998
Citations: 62

Similar Papers

Migration and Tuning of Software Prefetching for Sunway Multi-Core Processor
Xiuwu Gao ... Hongmei Wei
-
Xiuwu Gao, et. al.Xiuwu Gao ... Hongmei Wei
15 Apr 2022
15 Apr 2022

Interplay between hardware prefetcher and page eviction policy in CPU-GPU unified virtual memory
Debashis Ganguly ... Ziyu Zhang
-
Debashis Ganguly, et. al.Debashis Ganguly ... Ziyu Zhang
22 Jun 2019
22 Jun 2019

Evaluation of Hardware Data Prefetchers on Server Processors
Mohammad Bakhshalipour ... Hamid Sarbazi-Azad
ACM Computing Surveys | VOL. 52
Mohammad Bakhshalipour, et. al.Mohammad Bakhshalipour ... Hamid Sarbazi-Azad
18 Jun 2019
ACM Computing Surveys | VOL. 52

Hybrid analytical modeling of pending cache hits, data prefetching, and MSHRs
Xi E Chen ... Tor M Aamodt
-
Xi E Chen, et. al.Xi E Chen ... Tor M Aamodt
01 Nov 2008
01 Nov 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CPU cache prefetching: Timing evaluation of hardware implementations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers