An integrated model for effective saliency prediction

作者: Heng Tao Shen , Xiaoshuai Sun , Hongzhi Yin , Zi Huang

DOI:

关键词: Machine learningArtificial neural networkFeature learningNormalization (image processing)Artificial intelligenceSalience (neuroscience)SalientComputer science

摘要: In this paper, we proposed an integrated model of both semantic-aware and contrast-aware saliency (SCA) combining bottom-up top-down cues for effective eye fixation prediction. The contains two pathways. first pathway is a deep neural network customized saliency, which aims to capture the semantic information in images, especially presence meaningful objects object parts. second based on on-line feature learning maximization, learns adaptive representation input discovers high contrast salient patterns within image context. pathways characterize long-term short-term attention are using maxima normalization. Experimental results artificial images several benchmark dataset demonstrate superior performance better plausibility over classic approaches recent models.

参考文章(21)
Tilke Judd, Krista Ehinger, Fredo Durand, Antonio Torralba, Learning to predict where humans look international conference on computer vision. pp. 2106- 2113 ,(2009) , 10.1109/ICCV.2009.5459462
Ming Jiang, Shengsheng Huang, Juanyong Duan, Qi Zhao, SALICON: Saliency in Context computer vision and pattern recognition. pp. 1072- 1080 ,(2015) , 10.1109/CVPR.2015.7298710
Xiaoshuai Sun, Hongxun Yao, Rongrong Ji, Xian-Ming Liu, Toward Statistical Modeling of Saccadic Eye-Movement and Visual Saliency IEEE Transactions on Image Processing. ,vol. 23, pp. 4649- 4662 ,(2014) , 10.1109/TIP.2014.2337758
Xiaodi Hou, J. Harel, C. Koch, Image Signature: Highlighting Sparse Salient Regions IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 34, pp. 194- 201 ,(2012) , 10.1109/TPAMI.2011.146
Simone Frintrop, General object tracking with a component-based target descriptor international conference on robotics and automation. pp. 4531- 4536 ,(2010) , 10.1109/ROBOT.2010.5509638
Ruth Rosenholtz, Amal Dorai, Rosalind Freeman, Do predictions of visual perception aid design tests and proofs. ,vol. 8, pp. 12- ,(2011) , 10.1145/1870076.1870080
Yin Li, Xiaodi Hou, Christof Koch, James M. Rehg, Alan L. Yuille, The Secrets of Salient Object Segmentation computer vision and pattern recognition. pp. 280- 287 ,(2014) , 10.1109/CVPR.2014.43
A.A. Salah, E. Alpaydin, L. Akarun, A selective attention-based method for visual pattern recognition with application to handwritten digit recognition and face recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 24, pp. 420- 425 ,(2002) , 10.1109/34.990146
Liqing Zhang, Xiaodi Hou, Dynamic visual attention: searching for coding length increments neural information processing systems. ,vol. 21, pp. 681- 688 ,(2008)
L. Itti, Automatic foveation for video compression using a neurobiological model of visual attention IEEE Transactions on Image Processing. ,vol. 13, pp. 1304- 1318 ,(2004) , 10.1109/TIP.2004.834657