Can You Tell Me How to Get Past Sesame Street? Sentence-Level Pretraining Beyond Language Modeling

作者: Samuel R. Bowman , Ellie Pavlick , Benjamin Van Durme , Edouard Grave , Patrick Xia

DOI:

关键词:

摘要: Natural language understanding has recently seen a surge of progress with the use of sentence encoders like ELMo (Peters et al., 2018a) and BERT (Devlin et al., 2019) which …

参考文章(55)
William B. Dolan, Chris Brockett, Automatically Constructing a Corpus of Sentential Paraphrases. Proceedings of the Third International Workshop on Paraphrasing (IWP2005). ,(2005)
Ryan Kiros, Yukun Zhu, Ruslan R Salakhutdinov, Richard Zemel, Raquel Urtasun, Antonio Torralba, Sanja Fidler, None, Skip-thought vectors neural information processing systems. ,vol. 28, pp. 3294- 3302 ,(2015)
Leora Morgenstern, Ernest Davis, Hector J. Levesque, The Winograd schema challenge principles of knowledge representation and reasoning. pp. 552- 561 ,(2012)
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, C. Lawrence Zitnick, Microsoft COCO: Common Objects in Context Computer Vision – ECCV 2014. pp. 740- 755 ,(2014) , 10.1007/978-3-319-10602-1_48
Sepp Hochreiter, Jürgen Schmidhuber, Long short-term memory Neural Computation. ,vol. 9, pp. 1735- 1780 ,(1997) , 10.1162/NECO.1997.9.8.1735
Ronan Collobert, Pavel Kuksa, Léon Bottou, Koray Kavukcuoglu, Michael Karlen, Jason Weston, Natural Language Processing (Almost) from Scratch Journal of Machine Learning Research. ,vol. 12, pp. 2493- 2537 ,(2011)
Julia Hockenmaier, Mark Steedman, CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank Computational Linguistics. ,vol. 33, pp. 355- 396 ,(2007) , 10.1162/COLI.2007.33.3.355
Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, Hal Daumé III, Deep Unordered Composition Rivals Syntactic Methods for Text Classification Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). ,vol. 1, pp. 1681- 1691 ,(2015) , 10.3115/V1/P15-1162
Danqi Chen, Christopher Manning, A Fast and Accurate Dependency Parser using Neural Networks Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). pp. 740- 750 ,(2014) , 10.3115/V1/D14-1082