Estimating Top-k Destinations in Data Streams

作者: Nuno Homem , Joao Paulo Carvalho

DOI: 10.1007/978-3-642-14049-5_30

关键词:

摘要: One considers the problem of estimating most frequent values in a data stream. In many cases an approximate answer may be enough. A novel algorithm is presented to using mixed approach between counter-based techniques and sketch-based ones. The then used find destinations calls by individual customers telecommunications operators. use fast small footprint algorithms critical due huge number check answers are enough situations. that such detection needs performed for each customer kept up date at all times. This paper presents customer's behavior justify algorithms. Although this on well other contexts.

参考文章(15)
Gurmeet Singh Manku, Rajeev Motwani, Chapter 31 – Approximate Frequency Counts over Data Streams very large data bases. pp. 346- 357 ,(2002) , 10.1016/B978-155860869-6/50038-X
Ahmed Metwally, Divyakant Agrawal, Amr El Abbadi, Efficient Computation of Frequent and Top-k Elements in Data Streams Database Theory - ICDT 2005. pp. 398- 412 ,(2004) , 10.1007/978-3-540-30570-5_27
Erik D. Demaine, Alejandro López-Ortiz, J. Ian Munro, Frequency Estimation of Internet Packet Streams with Limited Space european symposium on algorithms. pp. 348- 360 ,(2002) , 10.1007/3-540-45749-6_33
Cristian Estan, George Varghese, New directions in traffic measurement and accounting Proceedings of the First ACM SIGCOMM Workshop on Internet Measurement - IMW '01. ,vol. 32, pp. 323- 336 ,(2001) , 10.1145/505202.505212
Mayur Datar, Aristides Gionis, Piotr Indyk, Rajeev Motwani, Maintaining Stream Statistics over Sliding Windows SIAM Journal on Computing. ,vol. 31, pp. 1794- 1813 ,(2002) , 10.1137/S0097539701398363
J. Misra, David Gries, Finding Repeated Elements Science of Computer Programming. ,vol. 2, pp. 143- 152 ,(1982) , 10.1016/0167-6423(82)90012-0
Kyu-Young Whang, Brad T. Vander-Zanden, Howard M. Taylor, A linear-time probabilistic counting algorithm for database applications ACM Transactions on Database Systems. ,vol. 15, pp. 208- 229 ,(1990) , 10.1145/78922.78925
Cristian Estan, George Varghese, New directions in traffic measurement and accounting ACM Transactions on Computer Systems. ,vol. 21, pp. 270- 313 ,(2003) , 10.1145/859716.859719
Philippe Flajolet, G. Nigel Martin, Probabilistic counting algorithms for data base applications Journal of Computer and System Sciences. ,vol. 31, pp. 182- 209 ,(1985) , 10.1016/0022-0000(85)90041-8
Cristian Estan, George Varghese, Mike Fisk, Bitmap algorithms for counting active flows on high speed links internet measurement conference. pp. 153- 166 ,(2003) , 10.1145/948205.948225