Continuous-Flow Matrix Transposition Using Memories

Mario Garrido,Peter Pirsch

doi:10.1109/tcsi.2020.2987736

Abstract

In this paper, we analyze how to calculate the matrix transposition in continuous flow by using a memory or group of memories. The proposed approach studies this problem for specific conditions such as square and non-square matrices, use of limited access memories and use of several memories in parallel. Contrary to previous approaches, which are based on specific cases or examples, the proposed approach derives the fundamental theory involved in the problem of matrix transposition in a continuous flow. This allows for obtaining the exact equations for the read and write addresses of the memories and other control signals in the circuits. Furthermore, the cases that involve non-square matrices, which have not been studied in detail in the literature, are analyzed in depth in this paper. Experimental results show that the proposed approach is capable of transposing matrices of $8192 \times 8192$ 32-bit data received in series at a rate of 200 mega samples per second, which doubles the throughput of previous approaches.

Highlights

M ATRIX transposition is an essential operation in a wide range of signal processing applications
This is due to the fact that it is used for the calculation of multidimensional transforms. This makes it a key component for the 2D fast Fourier transform (FFT) in image processing and machine vision [1], multiple-input multipleoutput (MIMO) [2], [3], automotive [4] and synthetic aperture radars [5]–[7]. It is required for the 3D FFT in molecular dynamics [8], motion detection [9]; for the 2D discrete cosine transform (DCT) in image compression [10], [11]; for the 2D fast Hartley transform (FHT) in image processing and circular convolution [12], [13]; and for the 3D fast Wavelet transform (FWT) in video encoding [14]
Matrix transposition is considered in convolutional neural networks (CNN) [15], [16] for artificial intelligence

Summary

INTRODUCTION

M ATRIX transposition is an essential operation in a wide range of signal processing applications. We provide a detailed analysis of the matrix transposition in a continuous flow using memories under any combination of specific conditions: Square and non-square matrices, use of limited access memories and use of several memories in parallel. For all these cases, efficient solutions that require a total memory size of order O(N) are presented. Whereas previous works study the problem based on examples, the proposed approach deepens in the mathematical and logical fundamentals of the problem This allows for obtaining the exact equations for the read and write addresses of the memories and other control signals.

MATRIX TRANSPOSITION IN A CONTINUOUS FLOW

REVIEW OF BIT-DIMENSION PERMUTATIONS

Bit-Dimension Permutations Using Memories

TRANSPOSITION OF SQUARE MATRICES IN A

TRANSPOSITION OF NON-SQUARE MATRICES IN A CONTINUOUS FLOW

MATRIX TRANSPOSITION USING LIMITED ACCESS MEMORIES

Problem Formulation

Square Matrices

Non-Square Matrices

USING MULTIPLE MEMORIES IN PARALLEL

Specification

EXPERIMENTAL RESULTS

COMPARISON

CONCLUSIONS

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Circuits and Systems I: Regular Papers	Publication Date: Sep 1, 2020
Citations: 30	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Continuous-Flow Matrix Transposition Using Memories

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers

Lead the way for us

Similar Papers

A Novel Pipelined Algorithm and Modular Architecture for Non-Square Matrix Transposition
Bo Zhang ... Feng Yu
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 68
Bo Zhang, et. al.Bo Zhang ... Feng Yu
05 Nov 2020
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 68

Optical neurodevices using a smart detector array
K Kyuma ... J Ohta
-
K Kyuma, et. al.K Kyuma ... J Ohta
01 Jan 1992
01 Jan 1992

Experimental characterization of a silicone oil-in-water droplet generator based on a micro T-junction
B Rostami ... G Puccetti
Journal of Physics: Conference Series | VOL. 796
B Rostami, et. al.B Rostami ... G Puccetti
01 Jan 2017
Journal of Physics: Conference Series | VOL. 796

Learning Dense and Continuous Optical Flow From an Event Camera.
Zhexiong Wan ... Yuchao Dai
IEEE Transactions on Image Processing | VOL. 31
Zhexiong Wan, et. al.Zhexiong Wan ... Yuchao Dai
01 Jan 2021
IEEE Transactions on Image Processing | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous-Flow Matrix Transposition Using Memories

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers