A Scalable Synthesis Methodology for Application-Specific Processors

Fei Sun,Anand Raghunathan,Niraj K Jha,Srivaths Ravi

doi:10.1109/tvlsi.2006.886410

Abstract

Custom processors based on application-specific or domain-specific instruction sets are gaining popularity, and are often used to implement critical architectural blocks in complex systems-on-chip. While several advances have been made in the area of custom processor architectures, tools, and design methodologies, designers are still required to manually perform some critical tasks, such as selection of the custom instructions best suited to the given application and design constraints. We present a scalable methodology for the synthesis of a custom processor from an embedded software program. A key feature of the proposed methodology is its scalability, which is achieved by exploiting the structured, hierarchical nature of large software programs. We motivate the need for such a methodology, and describe the algorithms used for the critical steps, including hardware resource budgeting, local optimizations, and global exploration. Our methodology utilizes the concept of instruction templates, which can be adapted by adding operations to them or deleting operations from them at any time during the design space exploration process, allowing for global design decisions to be interleaved with fine-grained optimizations. To the best of our knowledge, this is the first work that uses the program hierarchy to derive soft instruction templates to synthesize application-specific processors for scalable applications. We have integrated our methodology in an open-source compiler, and verified it using a commercial extensible processor. Experiments with several benchmarks indicate that our methodology can effectively tackle large programs. It results in the synthesis of high-quality custom processors that demonstrate an average speedup of 2.82times and a maximum speedup of 6.07times. As a side-effect, the processor energy is also reduced. The average and maximum reduction in the energy-delay product for the benchmarks are 7.64times and 18.85times, respectively. The CPU times required for custom processor synthesis are quite small, indicating that the proposed techniques can be applied to embedded software programs of significant complexity

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Scalable Synthesis Methodology for Application-Specific Processors

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Nov 1, 2006
Citations: 56

Similar Papers

A Scalable Application-Specific Processor Synthesis Methodology
...
-
, et. al. ...
09 Nov 2003
09 Nov 2003

Introduction to the special issue on application-specific processors
Philip Brisk ... Tulika Mitra
ACM Transactions on Embedded Computing Systems | VOL. 13
Philip Brisk, et. al.Philip Brisk ... Tulika Mitra
01 Sep 2013
ACM Transactions on Embedded Computing Systems | VOL. 13

How sensitive is processor customization to the workload's input datasets?
Maximilien Breughe ... Zheng Li
-
Maximilien Breughe, et. al.Maximilien Breughe ... Zheng Li
01 Jun 2011
01 Jun 2011

High Level Synthesis Methodology for Exploring Loop Unrolling Factor and Functional Datapath
Pallabi Sarkar ... Anirban Sengupta
-
Pallabi Sarkar, et. al.Pallabi Sarkar ... Anirban Sengupta
01 Dec 2018
01 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Scalable Synthesis Methodology for Application-Specific Processors

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems