A multiple floating point coprocessor architecture

Lawrence Rauchwerger,Michael P Farmwald

doi:10.1145/88237.88239

Abstract

General purpose microprocessor based computers usually speed their arithmetic processing performance by using a floating point co-processor. Because adding more co-processors represents neither a technological nor a cost problem we investigated a system based on a MIPS R2000 [2] and 4 floating point units. In this paper we show a block diagram of such an implementation and how two important scientific operations can be accelerated using a single unmodified data bus. A large percentage of the engineering applications are solved with the help of linear algebra methods like BLAS3 [4] algorithms; It is precisely for these primitives that the proposed architecture brings significant performance gains. The first operation described will be a matrix multiplication algorithm, its timing diagram and some results. Next a polynomial evaluation technique will be examined. Finally we show how to use the same ideas with various other microprocessors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multiple floating point coprocessor architecture

Abstract

Talk to us

Similar Papers

More From: ACM SIGARCH Computer Architecture News

Lead the way for us

Journal: ACM SIGARCH Computer Architecture News	Publication Date: May 1, 1990
Citations: 1

Similar Papers

A multiple floating point coprocessor architecture
...
-
, et. al. ...
30 Nov 1990
30 Nov 1990

A multiple floating point coprocessor architecture
L Rauchwerger ... P.M Farmwald
-
L Rauchwerger, et. al.L Rauchwerger ... P.M Farmwald
04 Dec 2002
04 Dec 2002

Scalable and modular algorithms for floating-point matrix multiplication on FPGAs
Ling Zhuo ... V.K Prasanna
-
Ling Zhuo, et. al. Ling Zhuo ... V.K Prasanna
26 Apr 2004
26 Apr 2004

A 4.5 mm/sup 2/ multiplier array for a 200 MFLOP pipelined coprocessor
C Heikes
-
C HeikesC Heikes
16 Feb 1994
A 4.5 mm/sup 2/ multiplier array for a 200 MFLOP pipelined coprocessor
C Heikes

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multiple floating point coprocessor architecture

Abstract

Talk to us

Similar Papers

More From: ACM SIGARCH Computer Architecture News