Nonlinear Spike-And-Slab Sparse Coding for Interpretable Image Encoding

作者: Jacquelyn A. Shelton , Abdul-Saboor Sheikh , Jörg Bornschein , Philip Sterne , Jörg Lücke

DOI: 10.1371/JOURNAL.PONE.0124088

关键词: Laplace transformPixelPrior probabilityProbabilistic logicComputer scienceGibbs samplingAlgorithmCauchy distributionNeural codingNonlinear systemGeneral Biochemistry, Genetics and Molecular BiologyGeneral Agricultural and Biological SciencesGeneral Medicine

摘要: Sparse coding is a popular approach to model natural images but has faced two main challenges: modelling low-level image components (such as edge-like structures and their occlusions) varying pixel intensities. Traditionally, are modelled sparse linear superposition of dictionary elements, where the probabilistic view this problem that coefficients follow Laplace or Cauchy prior distribution. We propose novel instead uses spike-and-slab nonlinear combination components. With prior, our can easily represent exact zeros for e.g. absence an component, such edge, distribution over non-zero nonlinearity (the max rule), idea target occlusions; elements correspond occlude each other. There major consequences assumptions made by both (non)linear approaches, thus goal paper isolate highlight differences between them. Parameter optimization analytically computationally intractable in model, contribution we design Gibbs sampler efficient inference which apply higher dimensional data using latent variable preselection. Results on artificial occlusion-rich with controlled forms structure show extract set closely match generating process, refer interpretable Furthermore, sparseness solution follows ground-truth number components/edges images. The did not learn any level sparsity. This suggests adaptively well-approximate characterize meaningful generation process.

参考文章(56)
Sam T. Roweis, Factorial models and refiltering for speech separation and denoising. conference of the international speech communication association. ,(2003)
Jörg Bornschein, Zhenwhen Dai, Approximate EM Learning on Large Computer Clusters ,(2010)
Miloš Hauskrecht, Tomáš Šingliar, Noisy-OR Component Analysis and its Application to Link Analysis Journal of Machine Learning Research. ,vol. 7, pp. 2189- 2213 ,(2006)
Harri Valpola, Erkki Oja, Alexander Ilin, Aritti Honkela, Juha Karhunen, Nonlinear Blind Source Separation by Variational Bayesian Learning IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences. ,vol. 86, pp. 532- 541 ,(2003)
Marc Henniges, Gervasio Puertas, Jörg Bornschein, Julian Eggert, Jörg Lücke, Binary sparse coding international conference on latent variable analysis and signal separation. pp. 450- 457 ,(2010) , 10.1007/978-3-642-15995-4_56
Ian J Goodfellow, Aaron Courville, Yoshua Bengio, None, Spike-and-Slab Sparse Coding for Unsupervised Feature Discovery arXiv: Machine Learning. ,(2012)