An Empirical Study of Training Self-Supervised Vision Transformers.

作者: Kaiming He , Saining Xie , Xinlei Chen

DOI:

关键词:

摘要: This paper does not describe a novel method. Instead, it studies a straightforward, incremental, yet must-know baseline given the recent progress in computer vision: self …

参考文章(45)
R. Hadsell, S. Chopra, Y. LeCun, Dimensionality Reduction by Learning an Invariant Mapping computer vision and pattern recognition. ,vol. 2, pp. 1735- 1742 ,(2006) , 10.1109/CVPR.2006.100
Yann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, Lawrence D Jackel, None, Backpropagation applied to handwritten zip code recognition Neural Computation. ,vol. 1, pp. 541- 551 ,(1989) , 10.1162/NECO.1989.1.4.541
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90
Maria-Elena Nilsback, Andrew Zisserman, Automated Flower Classification over a Large Number of Classes indian conference on computer vision, graphics and image processing. pp. 722- 729 ,(2008) , 10.1109/ICVGIP.2008.47
Aapo Kyrola, Piotr Dollár, Lukasz Wesolowski, Yangqing Jia, Andrew Tulloch, Kaiming He, Ross B. Girshick, Priya Goyal, Pieter Noordhuis, Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour arXiv: Computer Vision and Pattern Recognition. ,(2017)
Yang You, Boris Ginsburg, Igor Gitman, Large Batch Training of Convolutional Networks arXiv: Computer Vision and Pattern Recognition. ,(2017)
Zhirong Wu, Yuanjun Xiong, Stella X. Yu, Dahua Lin, Unsupervised Feature Learning via Non-parametric Instance Discrimination computer vision and pattern recognition. pp. 3733- 3742 ,(2018) , 10.1109/CVPR.2018.00393
Aaron van den Oord, Yazhe Li, Oriol Vinyals, Representation Learning with Contrastive Predictive Coding arXiv: Learning. ,(2018)
Yoshua Bengio, Philip Bachman, Adam Trischler, R. Devon Hjelm, Karan Grewal, Samuel Lavoie-Marchildon, Alex Fedorov, Learning deep representations by mutual information estimation and maximization international conference on learning representations. ,(2018)
Frank Hutter, Ilya Loshchilov, Decoupled Weight Decay Regularization. international conference on learning representations. ,(2018)