Predictive and generative neural networks for object functionality

作者: Ruizhen Hu , Zihao Yan , Jingwen Zhang , Oliver Van Kaick , Ariel Shamir

DOI: 10.1145/3197517.3201287

关键词:

摘要: Humans can predict the functionality of an object even without any surroundings, since their knowledge and experience would allow them to "hallucinate" interaction or usage scenarios involving object. We develop predictive generative deep convolutional neural networks replicate this feat. Specifically, our work focuses on functionalities man-made 3D objects characterized by human-object object-object interactions. Our are trained a database scene contexts, called each consisting central one more surrounding objects, that represent functionalities. Given in isolation, functional similarity network (fSIM-NET), variation triplet network, is inferring functionality-revealing contexts. fSIM-NET complemented (iGEN-NET) segmentation (iSEG-NET). iGEN-NET takes single voxelized with label synthesizes surround, i.e., context which visually demonstrates corresponding functionality. iSEG-NET further separates interacting into different groups according types.

参考文章(38)
Koray Kavukcuoglu, Max Jaderberg, Karen Simonyan, Andrew Zisserman, Spatial transformer networks neural information processing systems. ,vol. 28, pp. 2017- 2025 ,(2015)
Yuke Zhu, Alireza Fathi, Li Fei-Fei, Reasoning about Object Affordances in a Knowledge Base Representation Computer Vision – ECCV 2014. pp. 408- 424 ,(2014) , 10.1007/978-3-319-10605-2_27
Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, Jianxiong Xiao, 3D ShapeNets: A deep representation for volumetric shapes computer vision and pattern recognition. pp. 1912- 1920 ,(2015) , 10.1109/CVPR.2015.7298801
Yixin Zhu, Yibiao Zhao, Song-Chun Zhu, Understanding tools: Task-oriented object modeling, learning and recognition computer vision and pattern recognition. pp. 2855- 2864 ,(2015) , 10.1109/CVPR.2015.7298903
Xi Zhao, He Wang, Taku Komura, Indexing 3D Scenes Using the Interaction Bisector Surface ACM Transactions on Graphics. ,vol. 33, pp. 22- ,(2014) , 10.1145/2574860
Jiang Wang, Yang Song, Thomas Leung, Chuck Rosenberg, Jingbin Wang, James Philbin, Bo Chen, Ying Wu, Learning Fine-Grained Image Similarity with Deep Ranking computer vision and pattern recognition. pp. 1386- 1393 ,(2014) , 10.1109/CVPR.2014.180
Y. Wang, K. Xu, J. Li, H. Zhang, A. Shamir, L. Liu, Z. Cheng, Y. Xiong, Symmetry Hierarchy of Man‐Made Objects Computer Graphics Forum. ,vol. 30, pp. 287- 296 ,(2011) , 10.1111/J.1467-8659.2011.01885.X
Youyi Zheng, Daniel Cohen-Or, Niloy J. Mitra, Smart Variations: Functional Substructures for Part Compatibility Computer Graphics Forum. ,vol. 32, pp. 195- 204 ,(2013) , 10.1111/CGF.12039
L. Stark, K. Bowyer, Achieving generalized object recognition through reasoning about association of function to structure IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 13, pp. 1097- 1104 ,(1991) , 10.1109/34.99242
Helmut Grabner, Juergen Gall, Luc Van Gool, What makes a chair a chair computer vision and pattern recognition. pp. 1529- 1536 ,(2011) , 10.1109/CVPR.2011.5995327