An Empirical Study of Training Self-Supervised Vision Transformers.

作者： Kaiming He , Saining Xie , Xinlei Chen

DOI:

关键词:

摘要: This paper does not describe a novel method. Instead, it studies a straightforward, incremental, yet must-know baseline given the recent progress in computer vision: self …

uni-trier.de 本地加速

arxiv.org 本地加速

arxiv-vanity.com 本地加速

thecvf.com 本地加速

arxiv.org PDF 下载加速

thecvf.com PDF 下载加速

参考文章(45)

R. Hadsell, S. Chopra, Y. LeCun, Dimensionality Reduction by Learning an Invariant Mapping computer vision and pattern recognition. ,vol. 2, pp. 1735- 1742 ,(2006) , 10.1109/CVPR.2006.100

Yann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, Lawrence D Jackel, None, Backpropagation applied to handwritten zip code recognition Neural Computation. ,vol. 1, pp. 541- 551 ,(1989) , 10.1162/NECO.1989.1.4.541

Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90

Maria-Elena Nilsback, Andrew Zisserman, Automated Flower Classification over a Large Number of Classes indian conference on computer vision, graphics and image processing. pp. 722- 729 ,(2008) , 10.1109/ICVGIP.2008.47

Aapo Kyrola, Piotr Dollár, Lukasz Wesolowski, Yangqing Jia, Andrew Tulloch, Kaiming He, Ross B. Girshick, Priya Goyal, Pieter Noordhuis, Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour arXiv: Computer Vision and Pattern Recognition. ,(2017)

Yang You, Boris Ginsburg, Igor Gitman, Large Batch Training of Convolutional Networks arXiv: Computer Vision and Pattern Recognition. ,(2017)

Zhirong Wu, Yuanjun Xiong, Stella X. Yu, Dahua Lin, Unsupervised Feature Learning via Non-parametric Instance Discrimination computer vision and pattern recognition. pp. 3733- 3742 ,(2018) , 10.1109/CVPR.2018.00393

Aaron van den Oord, Yazhe Li, Oriol Vinyals, Representation Learning with Contrastive Predictive Coding arXiv: Learning. ,(2018)

Yoshua Bengio, Philip Bachman, Adam Trischler, R. Devon Hjelm, Karan Grewal, Samuel Lavoie-Marchildon, Alex Fedorov, Learning deep representations by mutual information estimation and maximization international conference on learning representations. ,(2018)

10.

Frank Hutter, Ilya Loshchilov, Decoupled Weight Decay Regularization. international conference on learning representations. ,(2018)

An Empirical Study of Training Self-Supervised Vision Transformers.

来源期刊

我的账户

An Empirical Study of Training Self-Supervised Vision Transformers.

来源期刊

相似文章 7

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.

VidTr: Video Transformer Without Convolutions.

Self-Supervised Learning with Swin Transformers.

IAML Distill Blog: Transformers in Vision

Divide and Contrast: Self-supervised Learning from Uncurated Data.

Salient Objects in Clutter.

CogView: Mastering Text-to-Image Generation via Transformers.

我的账户