Abstract
Current parallelizing compilers cannot extract a significant fraction of the available parallelism in a loop if it has a complex and/or statically insufficiently defined access pattern. In this paper, a run-time technique based on insp/exec scheme (inspector phase and executor phase) is proposed for finding parallelism in loops. Our inspector can determine the wavefronts of a loop with any complex indirected array indexing pattern by building DEF-USE table. Experimental results show that the new method can handle any complex data dependence pattern that cannot be handled by the previous research, and also reveals that if the input loop workload is not uniform, an appropriate loop scheduling should be considered for achieving high parallelism.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.