作者: Yuchai Wan , Xiabi Liu , Kunqi Tong , Xue Wei , Yi Wu
DOI: 10.1007/978-3-642-34481-7_26
关键词: Gaussian 、 Mathematics 、 Nearest neighbor search 、 Minimum description length 、 Artificial intelligence 、 Mixture model 、 Data mining 、 Tree (data structure) 、 Search engine indexing 、 Image retrieval 、 Pattern recognition 、 Cluster analysis
摘要: This paper proposes a novel clustering based indexing approach called GMM-ClusterForest for supporting multi-features similarity search in high-dimensional spaces. We fit Gaussian Mixture Model (GMM) to data through the Expectation-Maximization (EM) algorithm estimating GMM parameters and Minimum Description Length (MDL) criterion selecting structure. Each component is taken as cluster center each point assigned according Bayesian decision rule. By performing this method hierarchically, an index tree constructed corresponding developed type of features. Then fulfilled by fusing trees all types features considered. evaluated proposed applying it example-based image retrieval conducting experiments on Corel 1000 dataset self-collected large dataset. The experimental results show that our effective promising.