Fast Monte Carlo Algorithms for Matrices I: Approximating Matrix Multiplication

作者: Petros Drineas , Ravi Kannan , Michael W. Mahoney

DOI: 10.1137/S0097539704442684

关键词: Order (ring theory)Matrix normCombinatoricsMathematicsSingular value decompositionRandomized algorithmMatrix (mathematics)AlgorithmSparse approximationMatrix multiplicationSparse matrix

摘要: Motivated by applications in which the data may be formulated as a matrix, we consider algorithms for several common linear algebra problems. These make more efficient use of computational resources, such computation time, random access memory (RAM), and number passes over data, than do previously known these In this paper, devise two matrix multiplication problem. Suppose $A$ $B$ (which are $m\times n$ $n\times p$, respectively) input matrices. our main algorithm, perform $c$ independent trials, where each trial randomly sample an element $\{ 1,2,\ldots, n\}$ with appropriate probability distribution ${\cal P}$ on n\}$. We form c$ $C$ consisting sampled columns $A$, scaled appropriately, $c\times $R$ using corresponding rows $B$, again appropriately. The choice column row scaling crucial features algorithm. When chosen judiciously, show that $CR$ is good approximation to $AB$. More precisely, $$ \left\|AB-CR\right\|_F = O(\left\|A\right\|_F \left\|B\right\|_F /\sqrt c) , $\|\cdot\|_F$ denotes Frobenius norm, i.e., $\|A\|^2_F=\sum_{i,j}A_{ij}^2$. This algorithm can implemented without storing matrices RAM, provided it stored external $O(c(m+n+p))$ additional RAM construct $R$. then present second similar spirit addition, model (the pass-efficient model) efficiency other approximate studied argue well suited many involving massive sets. model, scarce resources space time required presented any order entries (and not just or order), case where, e.g., has been written multiple agents. sparse representation, only nonzero written.

参考文章(42)
Christos H. Papadimitriou, Ziv Bar-Yossef, The complexity of massive data set computations University of California at Berkeley. ,(2002)
R Vershynin, Coordinate restrictions of linear operators in $l_2^n$ arXiv: Functional Analysis. ,(2017)
Ji-guang Sun, G. W. Stewart, Matrix perturbation theory ,(1990)
S. Vempala, Random projection: a new approach to VLSI layout foundations of computer science. pp. 389- 395 ,(1998) , 10.1109/SFCS.1998.743489
P. Drineas, R. Kannan, Fast Monte-Carlo algorithms for approximate matrix multiplication international conference on cluster computing. pp. 452- 459 ,(2001) , 10.1109/SFCS.2001.959921
Sridhar Rajagopalan, Prabhakar Raghavan, Monika R. Henzinger, Computing on data streams External memory algorithms. pp. 107- 118 ,(1999)
A. Frieze, R. Kannan, S. Vempala, Fast Monte-Carlo algorithms for finding low-rank approximations foundations of computer science. pp. 370- 378 ,(1998) , 10.1109/SFCS.1998.743487
Petros Drineas, Ravi Kannan, Michael W. Mahoney, Fast Monte Carlo Algorithms for Matrices II: Computing a Low-Rank Approximation to a Matrix SIAM Journal on Computing. ,vol. 36, pp. 158- 183 ,(2006) , 10.1137/S0097539704442696
Dimitris Achlioptas, Frank Mcsherry, Fast computation of low-rank matrix approximations Journal of the ACM. ,vol. 54, pp. 9- ,(2007) , 10.1145/1219092.1219097
Jon M. Kleinberg, Two algorithms for nearest-neighbor search in high dimensions symposium on the theory of computing. pp. 599- 608 ,(1997) , 10.1145/258533.258653