Abstract

In this paper a vectorized algorithm for simultaneously computing up to eight singular value decompositions (SVDs, each of the form [Formula: see text]) of real or complex matrices of order two is proposed. The algorithm extends to a batch of matrices of an arbitrary length [Formula: see text], that arises, for example, in the annihilation part of the parallel Kogbetliantz algorithm for the SVD of matrices of order [Formula: see text]. The SVD method for a single matrix of order two is derived first. It scales, in most instances error-free, the input matrix [Formula: see text] such that the scaled singular values cannot overflow whenever the elements of [Formula: see text] are finite, and then computes the URV factorization of the scaled matrix, followed by the SVD of the non-negative upper-triangular middle factor. A vector-friendly data layout for the batch is then introduced, where the same-indexed elements of each of the input and the output matrices form vectors, and the algorithm’s steps over such vectors are described. The vectorized approach is shown to be about three times faster than processing each matrix in the batch separately, while slightly improving accuracy over the straightforward method for the [Formula: see text] SVD.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.