Bounds for Frequency Estimation of Packet Streams.

作者: Prosenjit Bose , Pat Morin , Yihui Tang , Evangelos Kranakis

DOI:

关键词:

摘要: We consider the problem of approximating frequency frequently occurring elements in a stream length n using only memory size m n. This models process gathering statistics on Internet packet streaming that is small relative to number classes (e.g. IP addresses) packets. show when some data item occurs αn times n, FREQUENT algorithm Demaine et al. [4], can approximate a’s with an error no more than (1−α)n/m. also give lower-bound (1−α)n/(m+1) accuracy any deterministic counting algorithm, which implies nearly optimal. Finally, we randomized algorithms not be significantly accurate since there lower bound (1−α)Ω(n/m) expected algorithm.

参考文章(13)
Moses Charikar, Kevin Chen, Martin Farach-Colton, Finding Frequent Items in Data Streams international colloquium on automata languages and programming. ,vol. 312, pp. 693- 703 ,(2002) , 10.1016/S0304-3975(03)00400-6
Lynne Stokes, Jeffrey F. Naughton, Peter J. Haas, S. Seshadri, Sampling-Based Estimation of the Number of Distinct Values of an Attribute very large data bases. pp. 311- 322 ,(1995)
Erik D. Demaine, Alejandro López-Ortiz, J. Ian Munro, Frequency Estimation of Internet Packet Streams with Limited Space european symposium on algorithms. pp. 348- 360 ,(2002) , 10.1007/3-540-45749-6_33
Cristian Estan, George Varghese, New directions in traffic measurement and accounting Proceedings of the First ACM SIGCOMM Workshop on Internet Measurement - IMW '01. ,vol. 32, pp. 323- 336 ,(2001) , 10.1145/505202.505212
Hector Garcia-Molina, Rajeev Motwani, Narayanan Shivakumar, Jeffrey D. Ullman, Min Fang, Computing Iceberg Queries Efficiently very large data bases. pp. 299- 310 ,(1998)
Mayur Datar, Aristides Gionis, Piotr Indyk, Rajeev Motwani, Maintaining Stream Statistics over Sliding Windows SIAM Journal on Computing. ,vol. 31, pp. 1794- 1813 ,(2002) , 10.1137/S0097539701398363
Andrew Chi-Chin Yao, Probabilistic computations: Toward a unified measure of complexity 18th Annual Symposium on Foundations of Computer Science (sfcs 1977). pp. 222- 227 ,(1977) , 10.1109/SFCS.1977.24
Pankaj Gupta, Nick McKeown, Packet classification on multiple fields acm special interest group on data communication. ,vol. 29, pp. 147- 160 ,(1999) , 10.1145/316188.316217
Gurmeet Singh Manku, Rajeev Motwani, Approximate frequency counts over data streams Proceedings of the VLDB Endowment. ,vol. 5, pp. 1699- 1699 ,(2012) , 10.14778/2367502.2367508
Noga Alon, Yossi Matias, Mario Szegedy, The Space Complexity of Approximating the Frequency Moments Journal of Computer and System Sciences. ,vol. 58, pp. 137- 147 ,(1999) , 10.1006/JCSS.1997.1545