Architectures and Execution Models for Hardware/Software Compilation and Their System-Level Realization

Holger Lange,Andreas Koch

doi:10.1109/tc.2009.180

Abstract

We propose an execution model that orchestrates the fine-grained interaction of a conventional general-purpose processor (GPP) and a high-speed reconfigurable hardware accelerator (HA), the latter having full master-mode access to memory. We then describe how the resulting requirements can actually be realized efficiently in a custom computer by hardware architecture and system software measures. One of these is a low-latency HA-to-GPP signaling scheme with latency up to 23× times shorter than conventional approaches. Another one is a high-bandwidth shared memory interface that does not interfere with time-critical operating system functions executing on the GPP, and still makes 89 percent of the physical memory bandwidth available to the HA. Finally, we show two schemes with different flexibility/performance trade-offs for running the HA in protected virtual memory scenarios. All of the techniques and their interactions are evaluated at the system level using the full-scale virtual memory variant of the Linux operating system on actual hardware.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Architectures and Execution Models for Hardware/Software Compilation and Their System-Level Realization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: Oct 1, 2010
Citations: 49

Similar Papers

Hardware Accelerated Mappers for Hadoop MapReduce Streaming
Katayoun Neshatpour ... Houman Homayoun
IEEE Transactions on Multi-Scale Computing Systems | VOL. 4
Katayoun Neshatpour, et. al.Katayoun Neshatpour ... Houman Homayoun
01 Oct 2018
IEEE Transactions on Multi-Scale Computing Systems | VOL. 4

Integrated Hardware Architecture for Efficient Computation of the $n$-Best Bio-Sequence Local Alignments in Embedded Platforms
Nuno Sebastiao ... Paulo Flores
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 20
Nuno Sebastiao, et. al.Nuno Sebastiao ... Paulo Flores
01 Jul 2012
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 20

Unifying software and hardware of multithreaded reconfigurable applications within operating system processes

-

01 Jan 2006
01 Jan 2006

Review of ASIC accelerators for deep neural network
Raju Machupalli ... Mrinal Mandal
Microprocessors and Microsystems | VOL. 89
Raju Machupalli, et. al.Raju Machupalli ... Mrinal Mandal
12 Jan 2022
Microprocessors and Microsystems | VOL. 89

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Architectures and Execution Models for Hardware/Software Compilation and Their System-Level Realization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers