Online Learning for Matrix Factorization and Sparse Coding

作者: Julien Mairal , Guillermo Sapiro , Francis Bach , Jean Ponce

DOI:

关键词: Sparse PCAMatrix decompositionNon-negative matrix factorizationStochastic optimizationK-SVDSparse approximationOnline machine learningBasis pursuitComputer scienceTheoretical computer science

摘要: Sparse coding--that is, modelling data vectors as sparse linear combinations of basis elements--is widely used in machine learning, neuroscience, signal processing, and statistics. This paper focuses on the large-scale matrix factorization problem that consists learning set order to adapt it specific data. Variations this include dictionary non-negative principal component analysis. In paper, we propose address these tasks with a new online optimization algorithm, based stochastic approximations, which scales up gracefully large sets millions training samples, extends naturally various formulations, making suitable for wide range problems. A proof convergence is presented, along experiments natural images genomic demonstrating leads state-of-the-art performance terms speed both small sets.

参考文章(83)
Shai Shalev-Shwartz, Nathan Srebro, Ohad Shamir, Karthik Sridharan, Stochastic Convex Optimization. conference on learning theory. ,(2009)
Jonathan M. Borwein, Adrian S Lewis, Convex analysis and nonlinear optimization : theory and examples Springer. ,(2000)
Michael Zibulevsky, Barak A. Pearlmutter, Pavel Kisilev, Pau Bofill, Blind source separation by sparse decomposition Neural Computation. ,(2001)
K. Engan, S.O. Aase, J.H. Husoy, Frame based signal compression using method of optimal directions (MOD) international symposium on circuits and systems. ,vol. 4, pp. 1- 4 ,(1999) , 10.1109/ISCAS.1999.779928
Tomaso A. Poggio, Kah Kay Sung, Learning and example selection for object and pattern detection Massachusetts Institute of Technology. ,(1996)
Pierre Priouret, Michel Métivier, Albert Benveniste, Adaptive Algorithms and Stochastic Approximations ,(1990)
Rodolphe Jenatton, Guillaume Obozinski, Francis R. Bach, Structured Sparse Principal Component Analysis international conference on artificial intelligence and statistics. pp. 366- 373 ,(2009)
J Frédéric Bonnans, Alexander Shapiro, Perturbation Analysis of Optimization Problems ,(2000)
Laurent Jacob, Guillaume Obozinski, Jean-Philippe Vert, Group lasso with overlap and graph lasso Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 433- 440 ,(2009) , 10.1145/1553374.1553431