An approach of bag-of-words based on visual attention model for pornographic images recognition in compressed domain

作者: Jing Zhang , Lei Sui , Li Zhuo , Zhenwei Li , Yuncong Yang

DOI: 10.1016/J.NEUCOM.2012.11.029

关键词: Face (geometry)Computer visionImage (mathematics)Computer sciencePixelScale-invariant feature transformDomain (software engineering)Human visual system modelArtificial intelligenceBag-of-words modelFeature (computer vision)Pattern recognition

摘要: Bag-of-words (BoW) model has been widely used in pornographic images recognition and filtering. Most of existing methods create BoW from with a scale-invariant feature transform (SIFT) descriptor the pixel domain. These require extra processing time to decompress compressed formats. In addition, SIFT only views local points centers some regions as BoW, which ignores major role image region human visual system. Different above this paper, approach based on attention is proposed recognize domain, includes following steps: (1) face detected remove or ID photo benign images; (2) built according characteristics image; (3) are by domain; (4) four features color, texture, intensity skin extracted regions; (5) created k-means cluster (6) will be represent images. Experimental results show that can more accurately less computational time.

参考文章(32)
D.A. Forsyth, M.M. Fleck, Identifying nude pictures workshop on applications of computer vision. pp. 103- 108 ,(1996) , 10.1109/ACV.1996.572010
Margaret M. Fleck, David A. Forsyth, Chris Bregler, Finding Naked People european conference on computer vision. pp. 593- 602 ,(1996) , 10.1007/3-540-61123-1_173
Jing Zhang, Li Zhuo, Zhenwei Li, Lei Sui, Pornographic image region detection based on visual attention model in compressed domain Iet Image Processing. ,vol. 7, pp. 384- 391 ,(2013) , 10.1049/IET-IPR.2012.0381
Shiwei Zhao, Li Zhuo, Suyu Wang, Li Xiaoguang, Lansun Shen, Pornographic image recognition in compressed domain based on multi-cost sensitive decision tree international conference on computer science and information technology. ,vol. 4, pp. 225- 229 ,(2010) , 10.1109/ICCSIT.2010.5565198
Claudio Carpineto, Giovanni Romano, A Survey of Automatic Query Expansion in Information Retrieval ACM Computing Surveys. ,vol. 44, pp. 1- 50 ,(2012) , 10.1145/2071389.2071390
L. Sui, J. Zhang, L. Zhuo, Y.C. Yang, Research on pornographic images recognition method based on visual words in a compressed domain Iet Image Processing. ,vol. 6, pp. 87- 93 ,(2012) , 10.1049/IET-IPR.2011.0005
Yue Gao, Meng Wang, Zheng-Jun Zha, Jialie Shen, Xuelong Li, Xindong Wu, Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search IEEE Transactions on Image Processing. ,vol. 22, pp. 363- 376 ,(2013) , 10.1109/TIP.2012.2202676
Meng Wang, Bingbing Ni, Xian-Sheng Hua, Tat-Seng Chua, Assistive tagging: A survey of multimedia tagging with human-computer joint exploration ACM Computing Surveys. ,vol. 44, pp. 25- ,(2012) , 10.1145/2333112.2333120
Laurent Itti, Christof Koch, A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research. ,vol. 40, pp. 1489- 1506 ,(2000) , 10.1016/S0042-6989(99)00163-7
Yue Gao, Meng Wang, Dacheng Tao, Rongrong Ji, Qionghai Dai, 3-D Object Retrieval and Recognition With Hypergraph Analysis IEEE Transactions on Image Processing. ,vol. 21, pp. 4290- 4303 ,(2012) , 10.1109/TIP.2012.2199502