Learning to Learn from Noisy Web Videos

作者: Serena Yeung , Vignesh Ramanathan , Olga Russakovsky , Liyue Shen , Greg Mori

DOI: 10.1109/CVPR.2017.788

关键词:

摘要: Understanding the simultaneously very diverse and intricately fine-grained set of possible human actions is a critical open problem in computer vision. Manually labeling training videos feasible for some action classes but doesnt scale to full long-tailed distribution actions. A promising way address this leverage noisy data from web queries learn new actions, using semi-supervised or webly-supervised approaches. However, these methods typically do not domain-specific knowledge, rely on iterative hand-tuned policies. In work, we instead propose reinforcement learning-based formulation selecting right examples classifier search results. Our method uses Q-learning policy small labeled dataset, then automatically label visual concepts. Experiments challenging Sports-1M recognition benchmark as well additional newly emerging demonstrate that our able good policies use accurate concept classifiers.

参考文章(4)
Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition Proceedings of the IEEE. ,vol. 86, pp. 2278- 2324 ,(1998) , 10.1109/5.726791
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis, None, Human-level control through deep reinforcement learning Nature. ,vol. 518, pp. 529- 533 ,(2015) , 10.1038/NATURE14236
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90
Gabriel Dulac-Arnold, Peter Sunehag, Ben Coppin, Richard Evans, Reinforcement Learning in Large Discrete Action Spaces. arXiv: Artificial Intelligence. ,(2015)