Gaussian mixture model with precision matrices approximated by sparsely represented eigenvectors

作者: Niksa M. Jakovljevic

DOI: 10.1109/TELFOR.2014.7034441

关键词:

摘要: This paper proposes a model which approximates full covariance matrices in Gaussian mixture models (GMM) with reduced number of parameters and computations required for likelihood evaluations. In the proposed inverse (precision) are approximated using sparsely represented eigenvectors, i.e. each eigenvector covariance/precision matrix is as linear combination small vectors from an overcomplete dictionary. A maximum algorithm parameter estimation its practical implementation presented. Experimental results on speech recognition task show that while keeping word error rate close to one obtained by GMMs matrices, can reduce 45%.

参考文章(13)
N.K. Goel, R.A. Gopinath, Multiple linear transforms international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 481- 484 ,(2001) , 10.1109/ICASSP.2001.940872
Julien Mairal, Francis Bach, Jean Ponce, Guillermo Sapiro, Online dictionary learning for sparse coding Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 689- 696 ,(2009) , 10.1145/1553374.1553463
Ilse C. F. Ipsen, Rizwana Rehman, Perturbation Bounds for Determinants and Characteristic Polynomials SIAM Journal on Matrix Analysis and Applications. ,vol. 30, pp. 762- 776 ,(2008) , 10.1137/070704770
J.A. Bilmes, Factored sparse inverse covariance matrices international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 1009- 1012 ,(2000) , 10.1109/ICASSP.2000.859133
M.J.F. Gales, Semi-tied covariance matrices for hidden Markov models IEEE Transactions on Speech and Audio Processing. ,vol. 7, pp. 272- 281 ,(1999) , 10.1109/89.759034
Julien Mairal, Guillermo Sapiro, Francis Bach, Jean Ponce, Online Learning for Matrix Factorization and Sparse Coding Journal of Machine Learning Research. ,vol. 11, pp. 19- 60 ,(2010)
Pierre L. Dognin, Vaibhava Goel, John R. Hershey, Peder A. Olsen, A fast, accurate approximation to log likelihood of Gaussian mixture models international conference on acoustics, speech, and signal processing. pp. 3817- 3820 ,(2009) , 10.1109/ICASSP.2009.4960459
P.A. Olsen, R.A. Gopinath, Modeling inverse covariance matrices by basis expansion IEEE Transactions on Speech and Audio Processing. ,vol. 12, pp. 37- 46 ,(2004) , 10.1109/TSA.2003.819943
L.K. Saul, M.G. Rahim, Maximum likelihood and minimum classification error factor analysis for automatic speech recognition IEEE Transactions on Speech and Audio Processing. ,vol. 8, pp. 115- 125 ,(2000) , 10.1109/89.824696
V. Vanhoucke, A. Sankar, Mixtures of inverse covariances IEEE Transactions on Speech and Audio Processing. ,vol. 12, pp. 250- 264 ,(2004) , 10.1109/TSA.2004.825675