Soft Transfer Learning via Gradient Diagnosis for Visual Relationship Detection

作者: Diqi Chen , Xiaodan Liang , Yizhou Wang , Wen Gao

DOI: 10.1109/WACV.2019.00124

关键词: Context (language use)Task (project management)Object detectionFeature extractionTransfer of learningMachine learningArtificial intelligenceTask analysisVisualizationKnowledge transferComputer science

摘要: Detecting all visual relationships is posed as the most fundamental task towards ultimate semantic reasoning. However, due to rich context embedded in image and diverse language ambiguities, it unrealistic annotate list possible for providing a noise-free supervised setting. All prior approaches simply adopt traditional fully-supervised detection pipeline ignore effect of incomplete annotations on model convergence, resulting unstable optimization unsatisfactory performance. In this work, we make first attempt address critical issue reformulate via Soft Transfer Learning (STL), which aims transfer knowledge learned from hand into uncertain pairs self-supervised way. The process inferred principled gradient diagnosis. Extensive experiments VRD large-scale VG benchmarks demonstrate superiority our STL method.

参考文章(26)
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan, Show and tell: A neural image caption generator computer vision and pattern recognition. pp. 3156- 3164 ,(2015) , 10.1109/CVPR.2015.7298935
Xuejun Liao, Ya Xue, Lawrence Carin, Logistic regression with an auxiliary data source Proceedings of the 22nd international conference on Machine learning - ICML '05. pp. 505- 512 ,(2005) , 10.1145/1102351.1102415
ChengXiang Zhai, Jing Jiang, Instance Weighting for Domain Adaptation in NLP meeting of the association for computational linguistics. pp. 264- 271 ,(2007)
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, Li Fei-Fei, ImageNet Large Scale Visual Recognition Challenge International Journal of Computer Vision. ,vol. 115, pp. 211- 252 ,(2015) , 10.1007/S11263-015-0816-Y
Charles Elkan, Keith Noto, Learning classifiers from only positive and unlabeled data Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 213- 220 ,(2008) , 10.1145/1401890.1401920
Mingsheng Long, Mingsheng Long, Jianmin Wang, Michael Jordan, Yue Cao, Learning Transferable Features with Deep Adaptation Networks international conference on machine learning. ,vol. 1, pp. 97- 105 ,(2015)
Sinno Jialin Pan, Qiang Yang, A Survey on Transfer Learning IEEE Transactions on Knowledge and Data Engineering. ,vol. 22, pp. 1345- 1359 ,(2010) , 10.1109/TKDE.2009.191
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90