Attention is not Explanation

作者: Byron C. Wallace , Sarthak Jain

DOI:

关键词:

摘要: … We report that neither property is consistently observed by a BiLSTM with a standard attention mechanism in the context of text classification, question answering (QA), and Natural …

参考文章(33)
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, Yoshua Bengio, None, Show, Attend and Tell: Neural Image Caption Generation with Visual Attention international conference on machine learning. ,vol. 3, pp. 2048- 2057 ,(2015)
Diederik P. Kingma, Jimmy Ba, Adam: A Method for Stochastic Optimization arXiv: Learning. ,(2014)
Armand Joulin, Sumit Chopra, Alexander M. Rush, Bart van Merriënboer, Tomas Mikolov, Jason Weston, Antoine Bordes, Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks arXiv: Artificial Intelligence. ,(2015)
Edward Grefenstette, Phil Blunsom, Karl Moritz Hermann, Tomáš Kočiský, Will Kay, Lasse Espeholt, Mustafa Suleyman, Teaching machines to read and comprehend neural information processing systems. ,vol. 28, pp. 1693- 1701 ,(2015)
Samuel R. Bowman, Gabor Angeli, Christopher Potts, Christopher D. Manning, A large annotated corpus for learning natural language inference empirical methods in natural language processing. pp. 632- 642 ,(2015) , 10.18653/V1/D15-1075
Andrew Y. Ng, Christopher Potts, Andrew L. Maas, Dan Huang, Peter T. Pham, Raymond E. Daly, Learning Word Vectors for Sentiment Analysis meeting of the association for computational linguistics. pp. 142- 150 ,(2011)
Azadeh Nikfarjam, Abeed Sarker, Karen O’Connor, Rachel Ginn, Graciela Gonzalez, Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features. Journal of the American Medical Informatics Association. ,vol. 22, pp. 671- 681 ,(2015) , 10.1093/JAMIA/OCU041
Richard Socher, Andrew Ng, Christopher Potts, Christopher D. Manning, Jason Chuang, Alex Perelygin, Jean Wu, Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank empirical methods in natural language processing. pp. 1631- 1642 ,(2013)
Alistair E.W. Johnson, Tom J. Pollard, Lu Shen, Li-wei H. Lehman, Mengling Feng, Mohammad Ghassemi, Benjamin Moody, Peter Szolovits, Leo Anthony Celi, Roger G. Mark, MIMIC-III, a freely accessible critical care database Scientific Data. ,vol. 3, pp. 160035- 160035 ,(2016) , 10.1038/SDATA.2016.35
Ankur Parikh, Oscar Täckström, Dipanjan Das, Jakob Uszkoreit, A Decomposable Attention Model for Natural Language Inference empirical methods in natural language processing. pp. 2249- 2255 ,(2016) , 10.18653/V1/D16-1244