Long-Term Feature Banks for Detailed Video Understanding.

作者: Philipp Krähenbühl , Kaiming He , Christoph Feichtenhofer , Ross Girshick , Chao-Yuan Wu

DOI:

关键词:

摘要: … -term feature bank to explicitly enable these interactions. … feature bank L and a feature bank operator FBO(S, L) that computes interactions between the short-term and long-term features…

参考文章(54)
Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri, Learning Spatiotemporal Features with 3D Convolutional Networks 2015 IEEE International Conference on Computer Vision (ICCV). pp. 4489- 4497 ,(2015) , 10.1109/ICCV.2015.510
Ross Girshick, Fast R-CNN international conference on computer vision. pp. 1440- 1448 ,(2015) , 10.1109/ICCV.2015.169
Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid, Learning to Track for Spatio-Temporal Action Localization 2015 IEEE International Conference on Computer Vision (ICCV). pp. 3164- 3172 ,(2015) , 10.1109/ICCV.2015.362
Christian Szegedy, Sergey Ioffe, Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift international conference on machine learning. ,vol. 1, pp. 448- 456 ,(2015)
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, C. Lawrence Zitnick, Microsoft COCO: Common Objects in Context Computer Vision – ECCV 2014. pp. 740- 755 ,(2014) , 10.1007/978-3-319-10602-1_48
Georgia Gkioxari, Jitendra Malik, Finding action tubes computer vision and pattern recognition. pp. 759- 768 ,(2015) , 10.1109/CVPR.2015.7298676
Joe Yue-Hei Ng, Matthew Hausknecht, Sudheendra Vijayanarasimhan, Oriol Vinyals, Rajat Monga, George Toderici, Beyond short snippets: Deep networks for video classification computer vision and pattern recognition. pp. 4694- 4702 ,(2015) , 10.1109/CVPR.2015.7299101
Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Trevor Darrell, Kate Saenko, Long-term recurrent convolutional networks for visual recognition and description computer vision and pattern recognition. pp. 2625- 2634 ,(2015) , 10.1109/CVPR.2015.7298878
Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei, Large-Scale Video Classification with Convolutional Neural Networks computer vision and pattern recognition. pp. 1725- 1732 ,(2014) , 10.1109/CVPR.2014.223
Tim Althoff, Hyun Oh Song, Trevor Darrell, Detection bank Proceedings of the 20th ACM international conference on Multimedia - MM '12. pp. 1065- 1068 ,(2012) , 10.1145/2393347.2396384