Learning Personalized Modular Network Guided by Structured Knowledge

作者: Xiaodan Liang

DOI: 10.1109/CVPR.2019.00915

关键词:

摘要: The dominant deep learning approaches use a "one-size-fits-all" paradigm with the hope that underlying characteristics of diverse inputs can be captured via fixed structure. They also overlook importance explicitly modeling feature hierarchy. However, complex real-world tasks often require discovering reasoning paths for different to achieve satisfying predictions, especially challenging large-scale recognition label relations. In this paper, we treat structured commonsense knowledge (e.g. concept hierarchy) as guidance customizing more powerful and explainable network structures distinct inputs, leading dynamic individualized inference paths. Give an off-the-shelf large configuration, proposed Personalized Modular Network (PMN) is learned by selectively activating sequence modules where each them designated recognize particular levels knowledge. Learning semantic configurations activation align well regarded decision-making procedure, which solved new graph-based reinforcement algorithm. Experiments on three segmentation classification show our PMN superior performance reduced number while personalized module input.

参考文章(36)
Jifeng Dai, Kaiming He, Jian Sun, BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation international conference on computer vision. pp. 1635- 1643 ,(2015) , 10.1109/ICCV.2015.191
Yoshua Bengio, Deep Learning of Representations: Looking Forward Statistical Language and Speech Processing. pp. 1- 37 ,(2013) , 10.1007/978-3-642-39593-2_1
Andrew Rabinovich, Wei Liu, Alexander C. Berg, ParseNet: Looking Wider to See Better arXiv: Computer Vision and Pattern Recognition. ,(2015)
Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully convolutional networks for semantic segmentation computer vision and pattern recognition. pp. 3431- 3440 ,(2015) , 10.1109/CVPR.2015.7298965
Sepp Hochreiter, Jürgen Schmidhuber, Long short-term memory Neural Computation. ,vol. 9, pp. 1735- 1780 ,(1997) , 10.1162/NECO.1997.9.8.1735
David Schiebener, Jun Morimoto, Tamim Asfour, Aleš Ude, Integrating visual perception and manipulation for autonomous learning of object representations Adaptive Behavior. ,vol. 21, pp. 328- 345 ,(2013) , 10.1177/1059712313484502
Alexander G. Schwing, Raquel Urtasun, Fully Connected Deep Structured Networks arXiv: Computer Vision and Pattern Recognition. ,(2015)
John Langford, Tong Zhang, The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information neural information processing systems. ,vol. 20, pp. 817- 824 ,(2007)
A.G. Barto, R.S. Sutton, Reinforcement Learning: An Introduction ,(1988)
Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr, Conditional Random Fields as Recurrent Neural Networks international conference on computer vision. pp. 1529- 1537 ,(2015) , 10.1109/ICCV.2015.179