摘要: Abstract We propose UniT, a Unified Transformer model to simultaneously learn the most prominent tasks across different domains, ranging from object detection to natural language …
Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick, None, Mask R-CNN2017 IEEE International Conference on Computer Vision (ICCV). pp. 2980- 2988 ,(2017) , 10.1109/ICCV.2017.322
Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, Illia Polosukhin, None, Attention is All You Needneural information processing systems. ,vol. 30, pp. 5998- 6008 ,(2017)