A comparison of sorting algorithms for the connection machine CM-2

Guy E Blelloch,C Greg Plaxton,Bruce M Maggs,Marco Zagha,Stephen J Smith,Charles E Leiserson

doi:10.1145/113379.113380

Abstract

Sorting is arguably the most studied problem in computer science, both because it is used as a substep in many applications and because it is a simple, combinatorial problem with many interesting and diverse solutions. Sorting is also an important benchmark for parallel supercomputers. It requires significant communication bandwidth among processors, unlike many other supercomputer benchmarks, and the most efficient sorting algorithms communicate data in irregular patterns. Parallel algorithms for sorting have been studied since at least the 1960’s. An early advance in parallel sorting came in 1968 when Batcher discovered the elegant U(lg2 n)-depth bitonic sorting network [3]. For certain families of fixed interconnection networks, such as the hypercube and shuffle-exchange, Batcher’s bitonic sorting technique provides a parallel algorithm for sorting n numbers in U(lg2 n) time with n processors. The question of existence of a o(lg2 n)-depth sorting network remained open until 1983, when Ajtai, Komlos, and Szemeredi [1] provided an optimal U(lg n)-depth sorting network, but unfortunately, their construction leads to larger networks than those given by bitonic sort for all “practical” values of n. Leighton [15] has shown that any U(lg n)-depth family of sorting networks can be used to sort n numbers in U(lg n) time in the bounded-degree fixed interconnection network domain. Not surprisingly, the optimal U(lg n)-time fixed interconnection sorting networks implied by the AKS construction are also impractical. In 1983, Reif and Valiant proposed a more practical O(lg n)-time randomized algorithm for sorting [19], called flashsort. Many other parallel sorting algorithms have been proposed in the literature, including parallel versions of radix sort and quicksort [5], a variant of quicksort called hyperquicksort [23], smoothsort [18], column sort [15], Nassimi and Sahni’s sort [17], and parallel merge sort [6]. This paper reports the findings of a project undertaken at Thinking Machines Corporation to develop a fast sorting algorithm for the Connection Machine Supercomputer model CM-2. The primary goals of this project were:

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparison of sorting algorithms for the connection machine CM-2

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Performance Comparison of Parallel Sorting Algorithms on Homogeneous Cluster of Workstations
Lai Lai Win Kyi ... Nay Min Tun
Advanced Materials Research | VOL. 433-440
Lai Lai Win Kyi, et. al.Lai Lai Win Kyi ... Nay Min Tun
03 Jan 2012
Advanced Materials Research | VOL. 433-440

Chapter 8 - Comparison-Based In-Place Sorting with CUDA
Hagen Peters ... Ole Schulz-Hildebrandt
GPU Computing Gems Jade Edition | VOL. -
Hagen Peters, et. al.Hagen Peters ... Ole Schulz-Hildebrandt
30 Nov 2011
GPU Computing Gems Jade Edition | VOL. -

Parallel merge sort with double merging
Ahmet Uyar
-
Ahmet UyarAhmet Uyar
01 Oct 2014
01 Oct 2014

A parallel selection sorting algorithm on GPUs using binary search
Sweta Kumari ... Dhirendra Pratap Singh
-
Sweta Kumari, et. al.Sweta Kumari ... Dhirendra Pratap Singh
01 Aug 2014
01 Aug 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparison of sorting algorithms for the connection machine CM-2

Abstract

Talk to us

Similar Papers