Limit Laws for Heights in Generalized Tries and PATRICIA Tries

作者: Wojciech Szpankowski , Charles Knessl

DOI:

关键词: Extreme value theoryDouble exponential functionMellin transformMathematicsDiscrete mathematicsExponential functionAsymptotic distributionLimit of a functionSaddle pointDistribution (mathematics)Mathematical physics

摘要: Wojciech Szpankowskit Department of Computer Science Purdue University W. LafayeUe, IN 47907 UB.A. spa@cs.purdue.edu We consider digital trees such as (generalized) tries and PATRICIA tries, built from n random strings generated by an unbiased memoryless source (i.e., all symbols are equally likely). study limit laws the height which is defined longest path in trees. It turns out that this also represents number questions required to recognize distinct objects. shall identiry three natural regions distribution~. For region where most probability mas~ concentrated, asymptotic distribution extreme value type (Le., double exponential distribution). Surprisinglyenough, PATIUCIA trie behaves quite differently region; exhibits a Gaussian (with oscillating tcnn) around probable k l = llog2 + /21og2 ~ ~J+l. In fact, concentrates on one or two points. ma."iS concentrated at kI , however, there exist subsequences mass points hI 1 hI, hi 1. derive these results combination analytic methods generating functions, Mellin transform, saddle point method ideas applied mathematics linearization, matching WKB method. present some numerical verification our results.

参考文章(28)
Philippe Jacquet, Mireille Regnier, Trie partitioning process: limiting distributions colloquium on trees in algebra and programming. pp. 196- 210 ,(1986) , 10.1007/BFB0022669
Norman Bleistein, Richard A. Handelsman, Asymptotic Expansions of Integrals ,(1975)
Hosam M. Mahmoud, Evolution of random search trees ,(1991)
Steven A. Orszag, Carl M. Bender, Advanced mathematical methods for scientists and engineers ,(1978)
Philippe Flajolet, On the performance evaluation of extendible hashing and trie searching Acta Informatica. ,vol. 20, pp. 345- 369 ,(1983) , 10.1007/BF00264279
Wojciech Szpankowski, Patricia tries again revisited Journal of the ACM. ,vol. 37, pp. 691- 711 ,(1990) , 10.1145/96559.214080
Wojciech Szpankowski, On the height of digital trees and related problems Algorithmica. ,vol. 6, pp. 256- 277 ,(1991) , 10.1007/BF01759045
Boris Pittel, Paths in a random digital tree: limiting distributions Advances in Applied Probability. ,vol. 18, pp. 139- 155 ,(1986) , 10.2307/1427240
B Pittel, H Rubin, How many random questions are necessary to identify n distinct objects Journal of Combinatorial Theory, Series A. ,vol. 55, pp. 292- 312 ,(1990) , 10.1016/0097-3165(90)90072-5