Work Tape Research Articles

Systolic arrays were first introduced by Kung (see, e.g., [2] and [3]) as devices composed of processors of a few different types, which are regularly and locally connected. These processors are activated in a synchronous way by a unique clock which is the only global communication between them. This paper is a continuation of the work presented in [I] where ‘folding’ has been proposed as a technique for the design of systolic arrays. Here we first study the power of folding as a general geometric transformation. Then, as an application we show that two congruent sequences on a ‘regular’ grid can be identified by a limited number of foldings (cf. Theorem 3.5). This result can be used in the design of systolic arrays. Because of the importance of this motivation we start with an example of a systolic array which has been already briefly described in [ 11. Consider Kung and Leiserson’s hex-connected processor array for matrix multiplication [3, pp. 276-2801 modified in such a way that it applies to dense matrices. Fig. 1 illustrates the case where the dimension n of the matrix is equal to 3: the left-to-right and the right-to-left flows correspond to the two matrices A and B to be multiplied and the bottom-to-top flow to the product C = A x B. Each node represents an inner product step processor, i.e., a processor computing one step of a scalar product: s +s + ab (see Fig. 2). Assume we want to use this array to compute the different powers of a matrix A, which basically amounts to computing A, A X A = A2,. . . , Ak X Ak = A2k ).... One solution is to iteratively feed the different outputs of step k coming out in the upper part of the array, to both left and right inputs of step k + 1, i.e., to connect each yi to (Y~ and pi. However, in doing this we would create non-local connections and break the regularity of the layout. Instead, we can first fold the array (as one would fold a sheet of paper) along the axis 1, the righthand side coming on top of the left-hand side. The new array computes the same functions as the original one, very much in the same way as a Turing machine with a one way infinite working tape can simulate a Turing machine with a two-way infinite working tape, by folding the latter. Then, we can fold the new array along the axis 2 (the left-hand side on top of the right-hand side) and again along the axis 3 (the right-hand side on top of the left-hand side). The processors cxi, pi, yi will eventually occupy the same place and we must connect them to each other thus introducing only local connections. As a result the regularity of the initial lay-out is preserved. The price to pay for it is that the new array consists of up to 8 = 23 more

Read full abstract

A one-way preset Turing machine with base L is a nondeterministic on-line Turing machine with one working tape preset to a member of L. FINITEREVERSAL( L ) (FINITEVISIT ( L )) is the class of languages accepted by one-way preset Turing machines with bases in L which are limited to a finite number of reversals (visits). For any full semiAFL L , FINITEREVERSAL ( L ) is the closure of L under homomorphic replication or, equivalently, the closure of L under iteration of controls on linear context-free grammars while FINITEVISIT ( L ) is the result of applying controls from L to absolutely parallel grammars or, equivalently, the closure of L under deterministic two-way finite state transductions. If L is a full AFL with L ≠ FINITEVISIT( L ), then FINITEREVERSAL( L ) ≠ FINITEVISIT( L ). In particular, for one-way checking automata, k + 1 reversals are more powerful than k reversals, k + 1 visits are more powerful than k visits, k visits and k + 1 reversals are incomparable and there are languages definable within 3 visits but no finite number of reversals. Finite visit one-way checking automaton languages can be accepted by nondeterministic multitape Turing machines in space log 2 n. Results on finite visit checking automata provide another proof that not all context-free languages can be accepted by one-way nonerasing stack automata.

Read full abstract

Work Tape Research Articles

Related Topics

Articles published on Work Tape

A note on some simultaneous relations among time, space, and reversal for single work tape nondeterministic turing machines

A tradeoff theorem for space and reversal

Folding of the plane and the design of systolic arrays

An improved simulation result for ink-bounded turing machines

One way finite visit automata

Hierarchies of turing machines with restricted tape alphabet size

Some Bounds on the Storage Requirements of Sequential Machines and Turing Machines

Design and characteristics of a variable-length record sort using new fixed-length record sorting techniques

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Work Tape Research Articles

Related Topics

Articles published on Work Tape

A note on some simultaneous relations among time, space, and reversal for single work tape nondeterministic turing machines

A tradeoff theorem for space and reversal

Folding of the plane and the design of systolic arrays

An improved simulation result for ink-bounded turing machines

One way finite visit automata

Hierarchies of turing machines with restricted tape alphabet size

Some Bounds on the Storage Requirements of Sequential Machines and Turing Machines

Design and characteristics of a variable-length record sort using new fixed-length record sorting techniques