Alpha-Beta Divergences Discover Micro and Macro Structures in Data

作者: Pieter Abbeel , Ali Punjani , Karthik Narayan

DOI:

关键词:

摘要: Although recent work in non-linear dimensionality reduction investigates multiple choices of divergence measure during optimization (Yang et al., 2013; Bunte 2012), little discusses the direct effects that measures have on visualization. We study this relationship, theoretically and through an empirical analysis over 10 datasets. Our works shows how α β parameters generalized alpha-beta can be chosen to discover hidden macrostructures (categories, e.g. birds) or microstructures (fine-grained classes, toucans). method, which generalizes t-SNE (van der Maaten, 2008), allows us such structure without extensive grid searches (α, β) due our theoretical analysis: is apparent with particular generalize across also discuss efficient parallel CPU GPU schemes are non-trivial tree-structures employed large datasets do not fully fit into memory. method runs 20x faster than fastest published code (Vladymyrov & Carreira-Perpinan, 2014). conclude detailed case studies following very datasets: ILSVRC 2012, a standard computer vision dataset 1.2M images; SUSY, particle physics 5M instances; HIGGS, another 11M instances. This represents largest visualization attained by SNE methods. open-sourced code: http://rll.berkeley.edu/absne/.

参考文章(29)
Miguel Á. Carreira-Perpiñan, The elastic embedding algorithm for dimensionality reduction international conference on machine learning. pp. 167- 174 ,(2010)
Nathan Bell, Jared Hoberock, Thrust: A Productivity-Oriented Library for CUDA Programming Massively Parallel Processors (Third Edition)#R##N#A Hands-on Approach. pp. 359- 371 ,(2012) , 10.1016/B978-0-12-385963-1.00026-5
Christopher Rogan, Kinematical variables towards new dynamics at the LHC arXiv: High Energy Physics - Phenomenology. ,(2011)
Hsin-Chia Cheng, Zhenyu Han, Minimal kinematic constraints and m T2 Journal of High Energy Physics. ,vol. 2008, pp. 063- 063 ,(2008) , 10.1088/1126-6708/2008/12/063
Roland Memisevic, Geoffrey Hinton, 2005 Special Issue: Improving dimensionality reduction with spectral gradient descent Neural Networks. ,vol. 18, pp. 702- 710 ,(2005) , 10.1016/J.NEUNET.2005.06.034
Matthew R. Buckley, Joseph D. Lykken, Christopher Rogan, Maria Spiropulu, Super-razor and searches for sleptons and charginos at the LHC Physical Review D. ,vol. 89, pp. 055020- ,(2014) , 10.1103/PHYSREVD.89.055020
Andrzej Cichocki, Sergio Cruces, Shun-ichi Amari, Generalized Alpha-Beta Divergences and Their Application to Robust Nonnegative Matrix Factorization Entropy. ,vol. 13, pp. 134- 170 ,(2011) , 10.3390/E13010134
Peter N. Yianilos, Data structures and algorithms for nearest neighbor search in general metric spaces symposium on discrete algorithms. pp. 311- 321 ,(1993) , 10.5555/313559.313789
Samuel Kaski, Samuel Kaski, Jaakko Peltonen, Jaakko Peltonen, Zhirong Yang, Optimization Equivalence of Divergences Improves Neighbor Embedding international conference on machine learning. pp. 460- 468 ,(2014)
F.S. Samaria, A.C. Harter, Parameterisation of a stochastic model for human face identification workshop on applications of computer vision. pp. 138- 142 ,(1994) , 10.1109/ACV.1994.341300