An Enhanced Bag-of-Visual Word Vector Space Model to Represent Visual Content in Athletics Images

作者: Kraisak Kesorn , Stefan Poslad

DOI: 10.1109/TMM.2011.2170665

关键词:

摘要: Images that have a different visual appearance may be semantically related using higher level conceptualization. However, image classification and retrieval systems tend to rely only on the low-level structure within images. This paper presents framework deal with this semantic gap limitation by exploiting well-known bag-of-visual words (BVW) represent content. The novelty of is threefold. First, quality improved constructing from representative keypoints. Second, domain specific “non-informative words” are detected which useless content data but can degrade categorization capability. Distinct existing frameworks, two main characteristics for non-informative defined: high document frequency (DF) small statistical association all concepts in collection. third contribution novel method used restructure vector space model respect structural ontology order resolve synonym polysemy problems. experimental results show our disambiguate word senses effectively significantly improve classification, interpretation, performance athletics

参考文章(39)
Faraj Alhwarin, Chao Jie Wang, Danijela Ristic-Durrant, Axel Gräser, Improved SIFT-features matching for object recognition VoCS'08 Proceedings of the 2008 international conference on Visions of Computer Science: BCS International Academic Conference. pp. 179- 190 ,(2008) , 10.14236/EWIC/VOCS2008.16
Ron S. Kenett, Frederick W. Faltin, Fabrizio Ruggeri, editors-in-chief, Encyclopedia of statistics in quality and reliability : John Wiley. ,(2007)
Lihua Wang, Zhiwu Lu, Horace H. S. Ip, Image Categorization Based on a Hierarchical Spatial Markov Model Computer Analysis of Images and Patterns. ,vol. 5702, pp. 766- 773 ,(2009) , 10.1007/978-3-642-03767-2_93
Yongjin Lee, Kyunghee Lee, Sungbum Pan, Local and global feature extraction for face recognition Lecture Notes in Computer Science. pp. 219- 228 ,(2005) , 10.1007/11527923_23
Hinrich Schütze, Christopher D. Manning, Prabhakar Raghavan, Introduction to Information Retrieval ,(2005)
Shahar Jamshy, Eyal Krupka, Yehezkel Yeshurun, Reducing Keypoint Database Size Image Analysis and Processing – ICIAP 2009. pp. 113- 122 ,(2009) , 10.1007/978-3-642-04146-4_14
Dan Pelleg, Andrew W. Moore, X-means: Extending K-means with Efficient Estimation of the Number of Clusters international conference on machine learning. pp. 727- 734 ,(2000)
Olga Russakovsky, Li Fei-Fei, Attribute learning in large-scale datasets european conference on computer vision. pp. 1- 14 ,(2010) , 10.1007/978-3-642-35749-7_1