Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension

作者: Vladimir Braverman , Robert Krauthgamer , Aditya Krishnan , Roi Sinoff

DOI:

关键词:

摘要: Spectral functions of large matrices contains important structural information about the underlying data, and is thus becoming increasingly important. Many times, representing real-world data are \emph{sparse} or \emph{doubly sparse} (i.e., sparse in both rows columns), accessed as a \emph{stream} updates, typically organized \emph{row-order}. In this setting, where space (memory) limiting resource, all known algorithms require that polynomial dimension matrix, even for matrices. We address challenge by providing first whose requirement \emph{independent matrix dimension}, assuming doubly-sparse presented row-order. Our approximate Schatten $p$-norms, which we use turn to other spectral functions, such logarithm determinant, trace inverse, Estrada index. validate these theoretical performance bounds numerical experiments on social networks. further prove multiple passes unavoidable show extensions our primary technique, including trade-off between requirements number passes.

参考文章(40)
Lingfei Wu, Jesse Laeuchli, Vassilis Kalantzis, Andreas Stathopoulos, Efstratios Gallopoulos, Estimating the trace of the matrix inverse by interpolating from the diagonal of an approximate inverse Journal of Computational Physics. ,vol. 326, pp. 828- 844 ,(2016) , 10.1016/J.JCP.2016.09.001
Edoardo Di Napoli, Eric Polizzi, Yousef Saad, Efficient estimation of eigenvalue counts in an interval Numerical Linear Algebra With Applications. ,vol. 23, pp. 674- 692 ,(2016) , 10.1002/NLA.2048
David P. Woodruff, Huy L. Nguyen, Yi Li, On sketching matrix norms and the top singular vector symposium on discrete algorithms. pp. 1562- 1581 ,(2014) , 10.5555/2634074.2634188
Moses Charikar, Kevin Chen, Martin Farach-Colton, Finding Frequent Items in Data Streams international colloquium on automata languages and programming. ,vol. 312, pp. 693- 703 ,(2002) , 10.1016/S0304-3975(03)00400-6
T. S. Jayram, Hellinger Strikes Back: A Note on the Multi-party Information Complexity of AND Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques. ,vol. 5687, pp. 562- 573 ,(2009) , 10.1007/978-3-642-03685-9_42
Marc Bury, Chris Schwiegelshohn, Sublinear Estimation of Weighted Matchings in Dynamic Data Streams european symposium on algorithms. pp. 263- 274 ,(2015) , 10.1007/978-3-662-48350-3_23
Jinwoo Shin, Insu Han, Dmitry Malioutov, Large-scale log-determinant computation through stochastic Chebyshev expansions international conference on machine learning. pp. 908- 917 ,(2015)
Warren Schudy, Maxim Sviridenko, Concentration and moment inequalities for polynomials of independent random variables symposium on discrete algorithms. pp. 437- 446 ,(2012) , 10.5555/2095116.2095153
Ryan O'Donnell, Analysis of Boolean Functions ,(2014)
Yuchen Zhang, Martin Wainwright, Michael Jordan, None, Distributed Estimation of Generalized Matrix Rank: Efficient Algorithms and Lower Bounds international conference on machine learning. pp. 457- 465 ,(2015)