New serial and parallel recursive QR factorization algorithms for SMP systems

Erik Elmroth,Fred Gustavson

doi:10.1007/bfb0095328

New serial and parallel recursive QR factorization algorithms for SMP systems

Erik Elmroth, Fred Gustavson

https://doi.org/10.1007/bfb0095328

Copy DOI

Publication Date: Jan 1, 1998

Citations: 52

Affiliation: Umeå University, IBM Research - Thomas J. Watson Research Center

#Recursive Algorithm #Uniprocessor Algorithm + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We present a new recursive algorithm for the QR factorization of an m by n matrix A. The recursion leads to an automatic variable blocking that allow us to replace a level 2 part in a standard block algorithm by level 3 operations. However, there are some additional costs for performing the updates which prohibits the efficient use of the recursion for large n. This obstacle is overcome by using a hybrid recursive algorithm that outperforms the LAPACK algorithm DGEQRF by 78% to 21% as m=n increases from 100 to 1000. A successful parallel implementation on a PowerPC 604 based IBM SMP node based on dynamic load balancing is presented. For 2, 3, 4 processors and m=n=2000 it shows speedups of 1.96, 2.99, and 3.92 compared to our uniprocessor algorithm.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.