RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge

作者: Jinhui Tang , Lu Jin , Zechao Li , Shenghua Gao

DOI: 10.1109/TMM.2015.2476660

关键词:

摘要: For the task of RGB-D object recognition, it is important to identify suitable representations images, which can boost performance recognition. In this work, we propose a novel representation learning method for images by jointly incorporating underlying data structure and prior knowledge data. Specifically, convolutional neural networks (CNN) are employed learn image exploiting structure. To handle problem limited RGB depth multi-level hierarchies features trained on ImageNet from CNN transferred rich generic feature while labeled leveraged. On other hand, deep auto-encoders (DAE) exploit knowledge, overcome expensive computational cost optimization in encoding. The expected obtained integrating two types representations. verify effectiveness proposed method, thoroughly conduct extensive experiments publicly available datasets. encouraging experimental results compared with state-of-the-art approaches demonstrate advantages method.

参考文章(47)
Matthew Zeiler, Rob Fergus, Li Wan, Yann Le Cun, Sixin Zhang, Regularization of Neural Networks using DropConnect international conference on machine learning. pp. 1058- 1066 ,(2013)
Anran Wang, Jiwen Lu, Gang Wang, Jianfei Cai, Tat-Jen Cham, None, Multi-modal Unsupervised Feature Learning for RGB-D Scene Labeling Computer Vision – ECCV 2014. pp. 453- 467 ,(2014) , 10.1007/978-3-319-10602-1_30
Naiyan Wang, Siyi Li, Abhinav Gupta, Dit-Yan Yeung, None, Transferring Rich Feature Hierarchies for Robust Visual Tracking arXiv: Computer Vision and Pattern Recognition. ,(2015)
Saurabh Gupta, Ross Girshick, Pablo Arbeláez, Jitendra Malik, Learning Rich Features from RGB-D Images for Object Detection and Segmentation european conference on computer vision. pp. 345- 360 ,(2014) , 10.1007/978-3-319-10584-0_23
Liefeng Bo, Xiaofeng Ren, Dieter Fox, Unsupervised Feature Learning for RGB-D Based Object Recognition Experimental Robotics. pp. 387- 402 ,(2013) , 10.1007/978-3-319-00065-7_27
Herbert Bay, Tinne Tuytelaars, Luc Van Gool, SURF: speeded up robust features european conference on computer vision. ,vol. 1, pp. 404- 417 ,(2006) , 10.1007/11744023_32
Shenghua Gao, Ivor Wai-Hung Tsang, Liang-Tien Chia, Kernel sparse representation for image classification and face recognition european conference on computer vision. pp. 1- 14 ,(2010) , 10.1007/978-3-642-15561-1_1
Ilya Sutskever, Geoffrey E. Hinton, Alex Krizhevsky, Ruslan R. Salakhutdinov, Nitish Srivastava, Improving neural networks by preventing co-adaptation of feature detectors arXiv: Neural and Evolutionary Computing. ,(2012)
Lu Jin, Shenghua Gao, Zechao Li, Jinhui Tang, Hand-Crafted Features or Machine Learnt Features? Together They Improve RGB-D Object Recognition international symposium on multimedia. pp. 311- 319 ,(2014) , 10.1109/ISM.2014.56
Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun, Learning Hierarchical Features for Scene Labeling IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 1915- 1929 ,(2013) , 10.1109/TPAMI.2012.231