Fully distributed EM for very large datasets

作者: Jason Wolfe , Aria Haghighi , Dan Klein

DOI: 10.1145/1390156.1390305

关键词:

摘要: In EM and related algorithms, E-step computations distribute easily, because data items are … We present a framework that fully distributes the entire EM procedure. Each node interacts …

参考文章(10)
Mark A. Paskin, Carlos E. Guestrin, Robust probabilistic inference in distributed systems uncertainty in artificial intelligence. pp. 436- 445 ,(2004) , 10.5555/1036843.1036896
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Vincent J. Della Pietra, Stephen A. Della Pietra, Robert L. Mercer, Peter F. Brown, The mathematics of statistical machine translation: parameter estimation Computational Linguistics. ,vol. 19, pp. 263- 311 ,(1993)
A. P. Dempster, N. M. Laird, D. B. Rubin, Maximum Likelihood from Incomplete Data Via theEMAlgorithm Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 39, pp. 1- 22 ,(1977) , 10.1111/J.2517-6161.1977.TB01600.X
Cheng-tao Chu, Sang Kim, Yi-An Lin, YuanYuan Yu, Gary Bradski, Kunle Olukotun, Andrew Ng, None, Map-Reduce for Machine Learning on Multicore neural information processing systems. ,vol. 19, pp. 281- 288 ,(2006)
Yiming Yang, Fan Li, David D. Lewis, Tony G. Rose, RCV1: A New Benchmark Collection for Text Categorization Research Journal of Machine Learning Research. ,vol. 5, pp. 361- 397 ,(2004) , 10.5555/1005332.1005345
Padhraic Smyth, David Newman, Max Welling, Arthur U. Asuncion, Distributed Inference for Latent Dirichlet Allocation neural information processing systems. ,vol. 20, pp. 1081- 1088 ,(2007)
R.D. Nowak, Distributed EM algorithms for density estimation and clustering in sensor networks IEEE Transactions on Signal Processing. ,vol. 51, pp. 2245- 2253 ,(2003) , 10.1109/TSP.2003.814623
Jeffrey Dean, Sanjay Ghemawat, MapReduce Communications of the ACM. ,vol. 51, pp. 107- 113 ,(2008) , 10.1145/1327452.1327492