Conjugate mixture models for clustering multimodal data

作者: Vasil Khalidov , Florence Forbes , Radu Horaud

DOI: 10.1162/NECO_A_00074

关键词: Modality (human–computer interaction)Model selectionPattern recognitionConvergence (routing)MathematicsArtificial intelligenceCluster analysisInitializationGlobal optimizationExpectation–maximization algorithmMixture model

摘要: The problem of multimodal clustering arises whenever the data are gathered with several physically different sensors. Observations from modalities not necessarily aligned in sense there is no obvious way to associate or compare them some common space. A solution may consist considering multiple tasks independently for each modality. main difficulty such an approach guarantee that unimodal clusterings mutually consistent. In this letter, we show can be addressed within a novel framework: conjugate mixture models. These models exploit explicit transformations often available between unobserved parameter space (objects) and observation spaces (sensors). We formulate as likelihood maximization task derive associated expectation-maximization algorithm. convergence properties proposed algorithm thoroughly investigated. Several local global optimization techniques order increase its speed. Two initialization strategies compared. consistent model selection criterion proposed. variants tested evaluated 3D localization speakers using both auditory visual data.

参考文章(60)
Murray Aitkin, Donald B. Rubin, Estimation and Hypothesis Testing in Finite Mixture Models Journal of the royal statistical society series b-methodological. ,vol. 47, pp. 67- 75 ,(1985) , 10.1111/J.2517-6161.1985.TB01331.X
José A. Castellanos, Juan D. Tardós, Mobile Robot Localization and Map Building Springer US. ,(1999) , 10.1007/978-1-4615-4405-0
B. G. Quinn, G. J. McLachlan, N. L. Hjort, A Note on the Aitkin‐Rubin Approach to Hypothesis Testing in Mixture Models Journal of the royal statistical society series b-methodological. ,vol. 49, pp. 311- 314 ,(1987) , 10.1111/J.2517-6161.1987.TB01700.X
Russell A. Boyles, On the Convergence of the EM Algorithm Journal of the royal statistical society series b-methodological. ,vol. 45, pp. 47- 50 ,(1983) , 10.1111/J.2517-6161.1983.TB01229.X
Rajive Joshi, Arthur C Sanderson, Multisensor Fusion: A Minimal Representation Framework ,(1999)
Christopher M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics) Springer-Verlag New York, Inc.. ,(2006)
David A. Forsyth, Jean Ponce, Computer Vision: A Modern Approach ,(2002)
Jose A Castellanos, Juan D Tardos, None, Mobile Robot Localization and Map Building: A Multisensor Fusion Approach Kluwer Academic Publishers. ,(2000)
Michael I Jordan, Zoubin Ghahramani, Tommi S Jaakkola, Lawrence K Saul, None, An introduction to variational methods for graphical models Machine Learning. ,vol. 37, pp. 105- 161 ,(1999) , 10.1023/A:1007665907178