On the height of digital trees and related problems

作者: Wojciech Szpankowski

DOI: 10.1007/BF01759045

关键词: Statistical modelMathematicsMarkov chainTrieSuffixSuffix treeDiscrete mathematicsCombinatoricsPrefixWord (computer architecture)String (computer science)

摘要: This paper studies in a probabilistic framework some topics concerning the way words (strings) can overlap, and relationship of this to height digital trees associated with set words. A word is defined as random sequence (possibly infinite) symbols over finite alphabet. key notion analignment matrix {Cij}i,j=1n introduced whereCij length longest string that prefix theith thejth word. It proved an tree simply related alignment through order statistics. In particular, using observation proving inequalities for statistics, we establish adigital trie under anindependent model (i.e., all are statistically independent) asymptotically equal 2 logźn wheren number stored ź parameter model. result generalized three directions, namely considerb-tries,Markovian dependency among letters word), adependent words). when consecutive Markov dependent (Markovian model), then demonstrate converges probability where underlying chain. On other hand, suffix which fall into model, show does not exceed logźn, These results find plenty applications analysis data structures built

参考文章(25)
Alberto Apostolico, The Myriad Virtues of Subword Trees Springer, Berlin, Heidelberg. pp. 85- 96 ,(1985) , 10.1007/978-3-642-82456-2_6
Philippe Jacquet, Mireille Regnier, Trie partitioning process: limiting distributions colloquium on trees in algebra and programming. pp. 196- 210 ,(1986) , 10.1007/BFB0022669
Aho AV, JE Hopcroft, JD Ullman, The Design and Analysis of Computer Algorithms ,(1974)
Philippe Flajolet, On the performance evaluation of extendible hashing and trie searching Acta Informatica. ,vol. 20, pp. 345- 369 ,(1983) , 10.1007/BF00264279
Luc Devroye, Wojciech Szpankowski, Bonita Rais, A note on the height of suffix trees SIAM Journal on Computing. ,vol. 21, pp. 48- 53 ,(1992) , 10.1137/0221005
Samuel Karlin, Friedemann Ost, Counts of long aligned word matches among random letter sequences Advances in Applied Probability. ,vol. 19, pp. 293- 351 ,(1987) , 10.2307/1427422
Boris Pittel, Paths in a random digital tree: limiting distributions Advances in Applied Probability. ,vol. 18, pp. 139- 155 ,(1986) , 10.2307/1427240