作者: Jeffrey Scott Vitter , Yossi Matias , Eran Segal
关键词: Bundle 、 Block size 、 Disk space 、 Efficient algorithm 、 Internal memory 、 Counting sort 、 Upper and lower bounds 、 Auxiliary memory 、 Combinatorics 、 Mathematics
摘要: Many data sets to be sorted consist of a limited number distinct keys. Sorting such can thought as bundling together identical keys and having the bundles placed in order; we therefore denote this bundle sorting. We describe an efficient algorithm for sorting external memory that requires at most c(N/B)logM/Bk disk accesses, where N is keys, M size internal memory, k B transfer block size, 2