attr2vec: Jointly Learning Word and Contextual Attribute Embeddings with Factorization Machines

作者: Fabio Petroni , Vassilis Plachouras , Timothy Nugent , Jochen L. Leidner

DOI: 10.18653/V1/N18-1042

关键词:

摘要: The widespread use of word embeddings is associated with the recent successes many natural language processing (NLP) systems. key approach popular models such as word2vec and GloVe to learn dense vector representations from context words. More recently, other approaches have been proposed that incorporate different types contextual information, including topics, dependency relations, n-grams, sentiment. However, these typically integrate only limited additional often in ad hoc ways. In this work, we introduce attr2vec, a novel framework for jointly learning words attributes based on factorization machines. We perform experiments information. Our experimental results text classification task demonstrate using attr2vec Part-of-Speech (POS) tags improves compared independently. Moreover, train dependency-based show they exhibit higher similarity between functionally related traditional approaches.

参考文章(36)
Jiwei Li, Dan Jurafsky, Do Multi-Sense Embeddings Improve Natural Language Understanding? empirical methods in natural language processing. pp. 1722- 1732 ,(2015) , 10.18653/V1/D15-1200
Tomas Mikolov, Greg S. Corrado, Kai Chen, Jeffrey Dean, Efficient Estimation of Word Representations in Vector Space international conference on learning representations. ,(2013)
Yoon Kim, Convolutional Neural Networks for Sentence Classification empirical methods in natural language processing. pp. 1746- 1751 ,(2014) , 10.3115/V1/D14-1181
Albert Weichselbraun, Stefan Gindl, Arno Scharl, Extracting and Grounding Contextualized Sentiment Lexicons IEEE Intelligent Systems. ,vol. 28, pp. 39- 46 ,(2013) , 10.1109/MIS.2013.41
Steffen Rendle, Zeno Gantner, Christoph Freudenthaler, Lars Schmidt-Thieme, Fast context-aware recommendations with factorization machines international acm sigir conference on research and development in information retrieval. pp. 635- 644 ,(2011) , 10.1145/2009916.2010002
Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, Eytan Ruppin, Placing search in context Proceedings of the tenth international conference on World Wide Web - WWW '01. pp. 406- 414 ,(2001) , 10.1145/371920.372094
Steffen Rendle, Factorization Machines with libFM ACM Transactions on Intelligent Systems and Technology. ,vol. 3, pp. 57- ,(2012) , 10.1145/2168752.2168771
Andriy Mnih, Koray Kavukcuoglu, Learning word embeddings efficiently with noise-contrastive estimation neural information processing systems. ,vol. 26, pp. 2265- 2273 ,(2013)
Hieu Hoang, Philipp Koehn, Factored Translation Models empirical methods in natural language processing. pp. 868- 876 ,(2007)
Ilya Sutskever, Quoc V. Le, Oriol Vinyals, Sequence to Sequence Learning with Neural Networks neural information processing systems. ,vol. 27, pp. 3104- 3112 ,(2014)