Improving Cross-Domain Concept Detection via Object-Based Features

作者: Markus Mühling , Ralph Ewerth , Bernd Freisleben

DOI: 10.1007/978-3-319-23117-4_31

关键词: Computer scienceMultiple kernel learningObject detectionGeneralizationComputer visionConvolutional neural networkDomain (software engineering)Visual appearanceArtificial intelligenceTRECVIDObject (computer science)

摘要: Learned visual concept models often do not work well for other domains considered during training, because a concept's appearance strongly depends on the domain of corresponding image or video source. In this paper, novel approach to improve cross-domain detection is presented. The proposed uses features based object results in addition Bag-of-Visual-Words as inputs classifiers. Experiments conducted TRECVid videos using high-performance computing cluster show that additional use object-based significantly improves generalization properties learned settings, example, from broadcast news documentary films and vice versa.

参考文章(25)
Gunnar Rätsch, Alexander Zien, Sören Sonnenburg, Christian Widmer, Christian Gehl, Vojtěch Franc, Jonas Behr, Fabio de Bona, Alexander Binder, Sebastian Henschel, The SHOGUN Machine Learning Toolbox Journal of Machine Learning Research. ,vol. 11, pp. 1799- 1802 ,(2010) , 10.5555/1756006.1859911
Emine Yilmaz, Javed A. Aslam, Estimating average precision with incomplete and imperfect judgments Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06. pp. 102- 111 ,(2006) , 10.1145/1183614.1183633
Jun Yang, Rong Yan, Alexander G. Hauptmann, Cross-domain video concept detection using adaptive svms Proceedings of the 15th international conference on Multimedia - MULTIMEDIA '07. pp. 188- 197 ,(2007) , 10.1145/1291233.1291276
Jingdong Wang, Yinghai Zhao, Xiuqing Wu, Xian-Sheng Hua, Transductive multi-label learning for video concept detection Proceeding of the 1st ACM international conference on Multimedia information retrieval - MIR '08. pp. 298- 304 ,(2008) , 10.1145/1460096.1460145
Alexander Hauptmann, Rong Yan, Wei-Hao Lin, How many high-level concepts will fill the semantic gap in news video retrieval? conference on image and video retrieval. pp. 627- 634 ,(2007) , 10.1145/1282280.1282369
Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman, The Pascal Visual Object Classes (VOC) Challenge International Journal of Computer Vision. ,vol. 88, pp. 303- 338 ,(2010) , 10.1007/S11263-009-0275-4
Pedro F. Felzenszwalb, Ross B. Girshick, David McAllester, Cascade object detection with deformable part models computer vision and pattern recognition. pp. 2241- 2248 ,(2010) , 10.1109/CVPR.2010.5539906
Huan Li, Yuan Shi, Yang Liu, Alexander G Hauptmann, Zhang Xiong, None, Cross-domain video concept detection: A joint discriminative and generative active learning approach Expert Systems With Applications. ,vol. 39, pp. 12220- 12228 ,(2012) , 10.1016/J.ESWA.2012.04.054
Jun Yang, Alexander G. Hauptmann, (Un)Reliability of video concept detection Proceedings of the 2008 international conference on Content-based image and video retrieval - CIVR '08. pp. 85- 94 ,(2008) , 10.1145/1386352.1386367
Ralph Ewerth, Markus Mühling, Bernd Freisleben, Robust Video Content Analysis via Transductive Learning ACM Transactions on Intelligent Systems and Technology. ,vol. 3, pp. 41- ,(2012) , 10.1145/2168752.2168755