Efficient Instruction Scheduling Using Real-time Load Delay Tracking

Andreas Diavastos,Trevor E Carlson

doi:10.1145/3548681

Abstract

Issue time prediction processors use dataflow dependencies and predefined instruction latencies to predict issue times of repeated instructions. In this work, we make two key observations: (1) memory accesses often take additional time to arrive than the static, predefined access latency that is used to describe these systems. This is due to contention in the memory hierarchy and variability in DRAM access times, and (2) we find that these memory access delays often repeat across iterations of the same code. We propose a new processor microarchitecture that replaces a complex reservation-station-based scheduler with an efficient, scalable alternative. Our scheduling technique tracks real-time delays of loads to accurately predict instruction issue times and uses a reordering mechanism to prioritize instructions based on that prediction. To accomplish this in an energy-efficient manner we introduce (1) an instruction delay learning mechanism that monitors repeated load instructions and learns their latest delay, (2) an issue time predictor that uses learned delays and dataflow dependencies to predict instruction issue times, and (3) priority queues that reorder instructions based on their issue time prediction. Our processor achieves 86.2% of the performance of a traditional out-of-order processor, higher than previous efficient scheduler proposals, while consuming 30% less power.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Instruction Scheduling Using Real-time Load Delay Tracking

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Computer Systems

Lead the way for us

Journal: ACM Transactions on Computer Systems	Publication Date: Nov 24, 2022
Citations: 3

Similar Papers

Impact Analysis On A Memory Hierarchy Applied To IPNoSys Architecture
Alexandro Lima Damasceno ... Gustavo Girao Barreto Da Silva
IEEE Latin America Transactions | VOL. 15
Alexandro Lima Damasceno, et. al.Alexandro Lima Damasceno ... Gustavo Girao Barreto Da Silva
01 Apr 2017
IEEE Latin America Transactions | VOL. 15

Distributed prefetch-buffer/cache design for high performance memory systems
T Alexander ... G Kedem
-
T Alexander, et. al.T Alexander ... G Kedem
03 Feb 1996
03 Feb 1996

Memory access optimization through combined code scheduling, memory allocation, and array binding in embedded system design
Jungeun Kim ... Taewhan Kim
-
Jungeun Kim, et. al.Jungeun Kim ... Taewhan Kim
01 Jan 2004
01 Jan 2004

Data Cache Prefetching Using a Global History Buffer
K.J Nesbit ... J.E Smith
IEEE Micro | VOL. 25
K.J Nesbit, et. al.K.J Nesbit ... J.E Smith
01 Jan 2004
IEEE Micro | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Instruction Scheduling Using Real-time Load Delay Tracking

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Computer Systems