B-tries for disk-based string management

作者: Nikolas Askitis , Justin Zobel

DOI: 10.1007/S00778-008-0094-1

关键词: B-treeAlgorithmRange (mathematics)Data structureComputer scienceAuxiliary memoryString (computer science)Suffix treeTask (computing)

摘要: A wide range of applications require that large quantities data be maintained in sort order on disk. The B-tree, and its variants, are an efficient general-purpose disk-based structure is almost universally used for this task. B-trie has the potential to a competitive alternative storage where strings as keys, but not previously been thoroughly described or tested. We propose new algorithms insertion, deletion, equality search variable-length disk-resident B-trie, well novel splitting strategies which critical element practical implementation. experimentally compare against variants B-tree several sets with characteristics. Our results demonstrate that, although uses more memory, it faster, scalable, requires less disk space.

参考文章(104)
Greg Gagne, Peter Baer Galvin, Abraham Silberschatz, Operating System Concepts 7th Edition with Java 7th Edition John Wiley & Sons. ,(2006)
Ricardo A. Baeza-Yates, An adaptive overflow technique for B-trees (extended abstract) extending database technology. pp. 16- 28 ,(1990)
Donald E. Knuth, The art of computer programming, volume 3: (2nd ed.) sorting and searching Addison Wesley Longman Publishing Co., Inc.. ,(1998)
Robert Sedgewick, Michael Schidlowsky, Algorithms in Java, Third Edition, Parts 1-4: Fundamentals, Data Structures, Sorting, Searching Addison-Wesley Longman Publishing Co., Inc.. ,(1998)
Murray Sherk, Self-Adjusting k-ary Search Trees workshop on algorithms and data structures. pp. 381- 392 ,(1989) , 10.1007/3-540-51542-9_32
Nikolas Askitis, Justin Zobel, Cache-Conscious Collision Resolution in String Hash Tables String Processing and Information Retrieval. pp. 91- 102 ,(2005) , 10.1007/11575832_11
Richard E. Ladner, Ray Fortna, Bao-Hoang Nguyen, A comparison of cache aware and cache oblivious static search trees using program instrumentation Lecture Notes in Computer Science. pp. 78- 92 ,(2002) , 10.1007/3-540-36383-1_4
Juha Kärkkäinen, S. Srinivasa Rao, Full-Text Indexes in External Memory Algorithms for Memory Hierarchies. pp. 149- 170 ,(2003) , 10.1007/3-540-36574-5_7
Pang Ko, Srinivas Aluru, Optimal Self-adjusting Trees for Dynamic String Data in Secondary Storage String Processing and Information Retrieval. pp. 184- 194 ,(2007) , 10.1007/978-3-540-75530-2_17