A framework for dynamic restructuring of semantic video analysis systems based on learning attention control

作者: Mohamad-Hoseyn Sigari , Hamid Soltanian-Zadeh , Hamid-Reza Pourreza

DOI: 10.1016/J.IMAVIS.2015.07.004

关键词: Structure (mathematical logic)Computer scienceRestructuringQ-learningArtificial intelligenceAttentional controlMachine learningComputational complexity theoryEvent (computing)

摘要: Abstract Current semantic video analysis systems are usually hierarchical and consist of some levels to overcome gaps between low-level features high-level concepts. In these systems, features, descriptors, objects or concepts extracted in each level therefore, total computational complexity such is huge. this paper, we present a new general framework impose attention control on system using Q-learning. Thus, our proposed restructures given dynamically direct the blocks extracting most informative features/concepts reduces system. other words, directs flow processing actively learning method. The evaluated for event detection broadcast soccer videos limited numbers training samples. Experiments show that able learn how restructure initial structure reach final goal with less complexity.

参考文章(54)
Nobuyuki Otsu, A Threshold Selection Method from Gray-Level Histograms IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 9, pp. 62- 66 ,(1979) , 10.1109/TSMC.1979.4310076
Dian W. Tjondronegoro, Yi-Ping Phoebe Chen, Knowledge-Discounted Event Detection in Sports Video systems man and cybernetics. ,vol. 40, pp. 1009- 1024 ,(2010) , 10.1109/TSMCA.2010.2046729
A. Ekin, A.M. Tekalp, R. Mehrotra, Automatic soccer video analysis and summarization IEEE Transactions on Image Processing. ,vol. 12, pp. 796- 807 ,(2003) , 10.1109/TIP.2003.812758
Mei-Ling Shyu, Zongxing Xie, Min Chen, Shu-Ching Chen, Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework IEEE Transactions on Multimedia. ,vol. 10, pp. 252- 259 ,(2008) , 10.1109/TMM.2007.911830
Anne M. Treisman, Garry Gelade, A feature-integration theory of attention Cognitive Psychology. ,vol. 12, pp. 97- 136 ,(1980) , 10.1016/0010-0285(80)90005-5
T. D'Orazio, M. Leo, P. Spagnolo, P.L. Mazzeo, N. Mosca, M. Nitti, A. Distante, An Investigation Into the Feasibility of Real-Time Soccer Offside Detection From a Multiple Camera System IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 19, pp. 1804- 1818 ,(2009) , 10.1109/TCSVT.2009.2026817
A. Cavallaro, O. Steiger, T. Ebrahimi, Semantic video analysis for adaptive content delivery and automatic description IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 15, pp. 1200- 1209 ,(2005) , 10.1109/TCSVT.2005.854240
Evaggelos Spyrou, Giorgos Tolias, Phivos Mylonas, Yannis Avrithis, Concept detection and keyframe extraction using a visual thesaurus Multimedia Tools and Applications. ,vol. 41, pp. 337- 373 ,(2009) , 10.1007/S11042-008-0237-9
Ali Borji, Laurent Itti, State-of-the-Art in Visual Attention Modeling IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 185- 207 ,(2013) , 10.1109/TPAMI.2012.89
Jialie Shen, Dacheng Tao, Xuelong Li, Modality Mixture Projections for Semantic Video Event Detection IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 18, pp. 1587- 1596 ,(2008) , 10.1109/TCSVT.2008.2005607