A comparison of sorting algorithms for the connection machine CM-2

作者: Guy E. Blelloch , Charles E. Leiserson , Bruce M. Maggs , C. Greg Plaxton , Stephen J. Smith

DOI: 10.1145/113379.113380

关键词:

摘要: Sorting is arguably the most studied problem in computer science, both because it used as a substep many applications and simple, combinatorial with interesting diverse solutions. also an important benchmark for parallel supercomputers. It requires significant communication bandwidth among processors, unlike other supercomputer benchmarks, efficient sorting algorithms communicate data irregular patterns. Parallel have been since at least 1960’s. An early advance came 1968 when Batcher discovered elegant U(lg2 n)-depth bitonic network [3]. For certain families of fixed interconnection networks, such hypercube shuffle-exchange, Batcher’s technique provides algorithm n numbers n) time processors. The question existence o(lg2 remained open until 1983, Ajtai, Komlos, Szemeredi [1] provided optimal U(lg network, but unfortunately, their construction leads to larger networks than those given by sort all “practical” values n. Leighton [15] has shown that any family can be bounded-degree domain. Not surprisingly, n)-time implied AKS are impractical. In Reif Valiant proposed more practical O(lg randomized [19], called flashsort. Many literature, including versions radix quicksort [5], variant hyperquicksort [23], smoothsort [18], column [15], Nassimi Sahni’s [17], merge [6]. This paper reports findings project undertaken Thinking Machines Corporation develop fast Connection Machine Supercomputer model CM-2. primary goals this were:

参考文章(33)
J.S. Huang, Y.C. Chow, Parallel sorting and data partitioning by sampling IEEE Computer Society Press,Silver Spring, MD. ,(1983)
Bruce Alan Wagar, Practical sorting algorithms for hypercube computers University of Michigan. ,(1991)
Lennart Johnsson, Combining Parallel and Sequential Sorting on a Boolean n–cube international conference on parallel processing. ,(1984)
Alok Aggarwal, Ming-Deh A. Huang, Network complexity of sorting and graph problems and simulating CRCW PRAMS by interconnection networks (preliminary version) AWOC '88 Proceedings of the 3rd Aegean Workshop on Computing: VLSI Algorithms and Architectures. pp. 339- 350 ,(1988) , 10.1007/BFB0040401
M. Ajtai, J. Komlós, E. Szemerédi, Sorting inc logn parallel steps Combinatorica. ,vol. 3, pp. 1- 19 ,(1983) , 10.1007/BF02579338
Baudet, Stevenson, Optimal Sorting Algorithms for Parallel Computers IEEE Transactions on Computers. ,vol. 27, pp. 84- 87 ,(1978) , 10.1109/TC.1978.1674957
Youngju Won, Sartaj Sahni, A balanced bin sort for hypercube multicomputers The Journal of Supercomputing. ,vol. 2, pp. 435- 448 ,(1988) , 10.1007/BF00156678
John H. Reif, Leslie G. Valiant, A logarithmic time sort for linear size networks Journal of the ACM. ,vol. 34, pp. 60- 76 ,(1987) , 10.1145/7531.7532
Richard Cole, Parallel merge sort SIAM Journal on Computing. ,vol. 17, pp. 770- 785 ,(1988) , 10.1137/0217049
Michael J. Quinn, Analysis and benchmarking of two parallel sorting algorithms: hyperquicksort and quickmerge Bit Numerical Mathematics. ,vol. 29, pp. 239- 250 ,(1989) , 10.1007/BF01952679