A Novel Fully Hardware-Implemented SVD Solver Based on Ultra-Parallel BCV Jacobi Algorithm

Tang Hu,Songnan Ren,Li Yan,Xiangdi Li,Zhiwei Xu,Xiao Yu,Xuyang Bai,Shiqiang Zhu

doi:10.1109/tcsii.2022.3200750

Abstract

Efficient FPGA-based floating-point singular value decomposition (SVD) is challenging for its enormous complexity with the rapid growth of the matrix dimension. Numerous hardware architectures have been proposed to improve the performance of SVD by increasing capacity of computation units, reusing data, and enhancing bandwidth. These designs, however, are not optimum due to their low parallelism, poor data access efficiency, and inferior iterations scheduling. In this express, we propose a block column vector Hestenes-Jacobi (BCV Jacobi) algorithm that decomposes an arbitrary large matrix into several blocks, enhances the access efficiency by customizing the distinctive data structure, and improves the system-level parallelism by simplifying the iteration scheduling. The proposed BCV Jacobi algorithm also achieves better scalability and efficiency. Experimental results show that the performance of the proposed FPGA based SVD processor is superior to other SVD implementations in terms of parallelism, data access efficiency, matrix size, and execution time. When compared with state of the art SVD accelerator engine, the proposed algorithm speeds up the runtime over <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$2{\times }$ </tex-math></inline-formula> on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Fully Hardware-Implemented SVD Solver Based on Ultra-Parallel BCV Jacobi Algorithm

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs

Lead the way for us

Similar Papers

A pipeline VLSI design of fast singular value decomposition processor for real-time EEG system based on on-line recursive independent component analysis
Kuan-Ju Huang ... Chih Wei Feng
-
Kuan-Ju Huang, et. al. Kuan-Ju Huang ... Chih Wei Feng
01 Jul 2013
01 Jul 2013

A parallel VLSI architecture of singular value decomposition processor for real-time multi-channel EEG system
Kuan-Ju Huang ... Jui-Chung Chang
-
Kuan-Ju Huang, et. al.Kuan-Ju Huang ... Jui-Chung Chang
01 Jun 2013
01 Jun 2013

Modal Analysis of Fluid Flows: An Overview
Kunihiko Taira ... Lawrence S Ukeiley
AIAA Journal | VOL. 55
Kunihiko Taira, et. al.Kunihiko Taira ... Lawrence S Ukeiley
31 Oct 2017
AIAA Journal | VOL. 55

A VLSI design of singular value decomposition processor used in real-time ICA computation for multi-channel EEG system
Kuan-Ju Huang ... Wei-Yeh Shih
-
Kuan-Ju Huang, et. al. Kuan-Ju Huang ... Wei-Yeh Shih
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Fully Hardware-Implemented SVD Solver Based on Ultra-Parallel BCV Jacobi Algorithm

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs