Designing Area and Performance Constrained SIMD/VLIW Image Processing Architectures

Hamed Fatemi,Pieter Jonker,Richard Kleihorst,Twan Basten,Henk Corporaal

doi:10.1007/11558484_87

Abstract

Image processing is widely used in many applications, including medical imaging, industrial manufacturing and security systems. In these applications, the size of the image is often very large, the processing time should be very small and the real-time constraints should be met. Therefore, during the last decades, there has been an increasing demand to exploit parallelism in applications. It is possible to explore parallelism along three axes: data-level parallelism (DLP), instruction-level parallelism (ILP) and task-level parallelism (TLP). This paper explores the limitations and bottlenecks of increasing support for parallelism along the DLP and ILP axes in isolation and in combination. To scrutinize the effect of DLP and ILP in our architecture (template), an area model based on the number of ALUs (ILP) and the number of processing elements (DLP) in the template is defined, as well as a performance model. Based on these models and the template, a set of kernels of image processing applications has been studied to find Pareto optimal architectures in terms of area and number of cycles via multi-objective optimization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Designing Area and Performance Constrained SIMD/VLIW Image Processing Architectures

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An embedded co-processor architecture for energy-efficient stream computing
Amrit Panda ... Karam S Chatha
-
Amrit Panda, et. al.Amrit Panda ... Karam S Chatha
01 Oct 2014
01 Oct 2014

Parallel Image Processing Concepts
Sukhbeer Singh ... Sarbjeet Singh
International Journal of Computer and Communication Technology | VOL. -
Sukhbeer Singh, et. al.Sukhbeer Singh ... Sarbjeet Singh
01 Jan 2010
International Journal of Computer and Communication Technology | VOL. -

Targeting code diversity with run-time adjustable issue-slots in a chip multiprocessor
F Anjam ... M Nadeem
-
F Anjam, et. al.F Anjam ... M Nadeem
01 Mar 2011
01 Mar 2011

A multi-threaded coarse-grained array processor for wireless baseband
Tom Vander Aa ... Martin Palkovic
-
Tom Vander Aa, et. al.Tom Vander Aa ... Martin Palkovic
01 Jun 2011
01 Jun 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Designing Area and Performance Constrained SIMD/VLIW Image Processing Architectures

Abstract

Talk to us

Similar Papers