Abstract
In this paper matching upper and lower bounds for broadcast on general purpose parallel computation models that exploit network locality are proven. These models try to capture both the general purpose properties of models like the PRAM or BSP on the one hand, and to exploit network locality of special purpose models like meshes, hypercubes, etc., on the other hand. They do so by charging a cost l(| i− j|) for a communication between processors i and j, where l is a suitably chosen latency function. An upper bound T( p)=∑ i=0 loglog p 2 i · l( p 1/2 i ) on the runtime of a broadcast on a p processor H-PRAM is given, for an arbitrary latency function l( k). The main contribution of the paper is a matching lower bound, holding for all latency functions in the range from l(k)=Ω( logk/ loglogk) to l( k)=O(log 2 k). This is not a severe restriction since for latency functions l( k)=O(log k/log 1+ ε log( k)) with arbitrary ε>0, the runtime of the algorithm matches the trivial lower bound Ω( logp) and for l( k)= Θ(log 1+ ε k) or l( k)= Θ( k ε ), the runtime matches the other trivial lower bound Ω(l(p)). Both upper and lower bounds apply for other parallel locality models like Y-PRAM, D-BSP and E-BSP, too.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.