Abstract

The paper considers the mapping of communicating modules of a parallelized task to the processing elements of a parallel computer when precedence relationships among the modules is available. The goal of the mapping is to minimize the total execution time of the task, including both processing and communications time, within a processor network of limited size. This paper presents a method for contracting complete binary precedence trees with n nodes to trees with (n+1)/2 nodes with no increase in execution time. The authors then provide methods for embedding these trees into hypercubes and m-dimensional meshes. When embedded into the hypercube of dimension log(/sup n+1///sub 2/), or into meshes with dimension m>or=(log(/sup n+1///sub 2/))/2, the contracted tree is embedded with unit dilation and with no increase in execution time. For meshes with dimension m >

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call