Diamond in the rough: finding Hierarchical Heavy Hitters in multi-dimensional data

作者: Graham Cormode , Flip Korn , S. Muthukrishnan , Divesh Srivastava

DOI: 10.1145/1007568.1007588

关键词:

摘要: … For data stream applications, we present online algorithms that find approximate HHHs in one pass, with provable accuracy guarantees. We show experimentally, using real and …

参考文章(13)
Laks V.S. Lakshmanan, Raymond T. Ng, Christine Xing Wang, Xiaodong Zhou, Theodore J. Johnson, The generalized MDL approach for summarization very large data bases. pp. 766- 777 ,(2002) , 10.1016/B978-155860869-6/50073-1
Jeffrey Scott Vitter, Min Wang, Bala Iyer, Data cube approximation and histograms via wavelets conference on information and knowledge management. pp. 96- 104 ,(1998) , 10.1145/288627.288645
Nitin Thaper, Sudipto Guha, Piotr Indyk, Nick Koudas, Dynamic multidimensional histograms Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02. pp. 428- 439 ,(2002) , 10.1145/564691.564741
Gurmeet Singh Manku, Rajeev Motwani, Approximate frequency counts over data streams Proceedings of the VLDB Endowment. ,vol. 5, pp. 1699- 1699 ,(2012) , 10.14778/2367502.2367508
Kevin Beyer, Raghu Ramakrishnan, Bottom-up computation of sparse and Iceberg CUBE ACM SIGMOD Record. ,vol. 28, pp. 359- 370 ,(1999) , 10.1145/304181.304214
Cristian Estan, Stefan Savage, George Varghese, Automatically inferring patterns of resource consumption in network traffic Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '03. pp. 137- 148 ,(2003) , 10.1145/863955.863972
Ju-Hong Lee, Deok-Hwan Kim, Chin-Wan Chung, Multi-dimensional selectivity estimation using compressed histogram information ACM SIGMOD Record. ,vol. 28, pp. 205- 214 ,(1999) , 10.1145/304181.304200
Sudipto Guha, Nick Koudas, Kyuseok Shim, Data-streams and histograms Proceedings of the thirty-third annual ACM symposium on Theory of computing - STOC '01. pp. 471- 475 ,(2001) , 10.1145/380752.380841
Graham Cormode, Flip Korn, S. Muthukrishnan, Divesh Srivastava, Finding hierarchical heavy hitters in data streams very large data bases. pp. 464- 475 ,(2003) , 10.1016/B978-012722442-8/50048-3