A Cost-Effective Architecture for Vectorizable Numerical and Multimedia Applications

Francisca Quintana,Mateo Valero,Roger Espasa,Jesus Corbal

doi:10.1007/s00224-003-1088-4

Abstract

This paper analyzes the performance of vector-dominated regions of code in numerical and multimedia applications in a superscalar + vector architecture and compares it with an eight-way superscalar processor. The ability to split a program’s execution into scalar and vector regions allows us to show that (1) as expected, the vector unit is much better than the wide-issue superscalar at executing the vector-dominated regions of the code; (2) on the scalar regions, the eight-way superscalar, although better than a four-way superscalar, is clearly not worth the extra complexity in terms of extra transistors and potential cycle-time limitations. Overall, the vector-enhanced superscalar is from 6% to 303% better than an eight-way superscalar. We also present detailed data on the performance of the memory system, which is usually the key limiting factor when running numerical and multi-\break media applications. We evaluate two additional cache designs that try to alleviate problems created by non-unit stride memory references.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Cost-Effective Architecture for Vectorizable Numerical and Multimedia Applications

Abstract

Talk to us

Similar Papers

More From: Theory of Computing Systems

Lead the way for us

Journal: Theory of Computing Systems	Publication Date: Jul 25, 2003
Citations: 14

Similar Papers

A cost effective architecture for vectorizable numerical and multimedia applications
Francisca Quintana ... Mateo Valero
-
Francisca Quintana, et. al.Francisca Quintana ... Mateo Valero
03 Jul 2001
03 Jul 2001

Evaluation of Large-scale Optimization Problems on Vector and Parallel Architectures
Brett M Averick ... Jorge J Moré
SIAM Journal on Optimization | VOL. 4
Brett M Averick, et. al.Brett M Averick ... Jorge J Moré
01 Nov 1994
SIAM Journal on Optimization | VOL. 4

Vectorized production path tracing
Mark Lee ... Feng Xie
-
Mark Lee, et. al.Mark Lee ... Feng Xie
28 Jul 2017
28 Jul 2017

Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA
Hancheng Wu ... John Ravi
-
Hancheng Wu, et. al.Hancheng Wu ... John Ravi
01 Dec 2018
01 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Cost-Effective Architecture for Vectorizable Numerical and Multimedia Applications

Abstract

Talk to us

Similar Papers

More From: Theory of Computing Systems