Abstract

A static task clustering scheduling algorithm is proposed for the parallel LU factorization of a sparse linear system on a shared-memory parallel computer. Assuming a column-based medium-grained task model, the parallel LU factorization process is accomplished using a nested bordered block diagonal node reordering technique. In order to minimize the overhead due to task synchronization on a shared-memory parallel machine, it is proposed to regroup tasks into task clusters in which tasks have to be executed sequentially. Both theoretical analysis and actual implementation using benchmark circuits indicate significant speedup compared with the conventional critical path scheduling algorithm. >

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call