Approximating Large Frequency Moments with O(n 1-2/k ) Bits.

作者: Vladimir Braverman , Gregory Vorsanger , Charles Seidell , Jonathan Katzman

DOI:

关键词:

摘要: In this paper we consider the problem of approximating frequency moments in streaming model. Given a stream $D = \{p_1,p_2,\dots,p_m\}$ numbers from $\{1,\dots, n\}$, $i$ is defined as $f_i |\{j: p_j i\}|$. The $k$-th \emph{frequency moment} $D$ $F_k \sum_{i=1}^n f_i^k$. In give an upper bound on space required to find moment $O(n^{1-2/k})$ bits that matches, up constant factor, lower Woodruff and Zhang (STOC 12) for $\epsilon$ $k$. Our algorithm makes single pass over works any $k > 3$.

参考文章(40)
David P. Woodruff, Morteza Monemizadeh, 1-pass relative-error Lp-sampling with applications symposium on discrete algorithms. pp. 1143- 1160 ,(2010) , 10.5555/1873601.1873693
Piotr Indyk, David Woodruff, Optimal approximations of the frequency moments of data streams Proceedings of the thirty-seventh annual ACM symposium on Theory of computing - STOC '05. pp. 202- 208 ,(2005) , 10.1145/1060590.1060621
Noga Alon, Yossi Matias, Mario Szegedy, The Space Complexity of Approximating the Frequency Moments Journal of Computer and System Sciences. ,vol. 58, pp. 137- 147 ,(1999) , 10.1006/JCSS.1997.1545
Daniel M. Kane, Jelani Nelson, David P. Woodruff, An optimal algorithm for the distinct elements problem Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems of data - PODS '10. pp. 41- 52 ,(2010) , 10.1145/1807085.1807094
Ziv Bar-Yossef, T.S. Jayram, Ravi Kumar, D. Sivakumar, An information statistics approach to data stream and communication complexity foundations of computer science. ,vol. 68, pp. 209- 218 ,(2002) , 10.1016/J.JCSS.2003.11.006
Sumit Ganguly, Estimating Frequency Moments of Data Streams Using Random Linear Combinations Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques. pp. 369- 380 ,(2004) , 10.1007/978-3-540-27821-4_33
Paul Beame, T. S. Jayram, Atri Rudra, Lower bounds for randomized read/write stream algorithms Proceedings of the thirty-ninth annual ACM symposium on Theory of computing - STOC '07. pp. 689- 698 ,(2007) , 10.1145/1250790.1250891
Jeffrey S. Vitter, Random sampling with a reservoir ACM Transactions on Mathematical Software. ,vol. 11, pp. 37- 57 ,(1985) , 10.1145/3147.3165
Stefan Dziembowski, Krzysztof Pietrzak, Smooth Histograms for Sliding Windows foundations of computer science. pp. 283- 293 ,(2007) , 10.1109/FOCS.2007.63
T. S. Jayram, David P. Woodruff, Optimal Bounds for Johnson-Lindenstrauss Transforms and Streaming Problems with Subconstant Error ACM Transactions on Algorithms. ,vol. 9, pp. 1- 17 ,(2013) , 10.1145/2483699.2483706