A constrained optimization algorithm for learning GloVe embeddings with semantic lexicons

作者: Flora Sakketou , Nicholas Ampazis

DOI: 10.1016/J.KNOSYS.2020.105628

关键词: Constrained optimizationText miningLeverage (statistics)AlgorithmSentiment analysisComputer scienceSemantic property

摘要: Abstract GloVe representations of words as vector embeddings in continuous spaces are learned from matrix factorization the words’ co-occurrences constructed large corpora. Due to their high quality textual features, have been extensively utilized for many text mining and natural language processing tasks with considerable success. Further improvements these word can be obtained by also taking into account valuable information semantic properties complex relationships between them provided lexicons. In this paper we adopt optimization techniques domain machine learning constrained order leverage relational knowledge words, propose an efficient algorithm that produces enhanced information. The proposed outperforms other related approaches utilize either during training or a post-processing step. Our claims validated experiments on popular tasks, including similarities, analogies, sentiment analysis, which demonstrate our model significantly improve representations.

参考文章(44)
Stavros J. Perantonis, Nikolaos Ampazis, Vassilis Virvilis, A Learning Framework for Neural Networks Using Constrained Optimization Methods Annals of Operations Research. ,vol. 99, pp. 385- 401 ,(2000) , 10.1023/A:1019240304484
Jiang Bian, Bin Gao, Tie-Yan Liu, Knowledge-Powered Deep Learning for Word Embedding european conference on machine learning. pp. 132- 148 ,(2014) , 10.1007/978-3-662-44848-9_9
Arnold D. Well, Jerome L. Myers, Research Design and Statistical Analysis ,(1991)
Manaal Faruqui, Jesse Dodge, Sujay Kumar Jauhar, Chris Dyer, Eduard Hovy, Noah A. Smith, Retrofitting Word Vectors to Semantic Lexicons north american chapter of the association for computational linguistics. pp. 1606- 1615 ,(2015) , 10.3115/V1/N15-1184
Kevin Duh, Daniel Fried, Incorporating Both Distributional and Relational Semantics in Word Representations arXiv: Computation and Language. ,(2014)
A. E. Bryson, W. F. Denham, A Steepest-Ascent Method for Solving Optimum Programming Problems Journal of Applied Mechanics. ,vol. 29, pp. 247- 257 ,(1962) , 10.1115/1.3640537
Jiliang Tang, Xia Hu, Huan Liu, Social recommendation: a review Social Network Analysis and Mining. ,vol. 3, pp. 1113- 1133 ,(2013) , 10.1007/S13278-013-0141-9
Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, Eytan Ruppin, Placing search in context Proceedings of the tenth international conference on World Wide Web - WWW '01. pp. 406- 414 ,(2001) , 10.1145/371920.372094
Herbert Rubenstein, John B. Goodenough, Contextual correlates of synonymy Communications of the ACM. ,vol. 8, pp. 627- 633 ,(1965) , 10.1145/365628.365657
George A. Miller, WordNet Communications of the ACM. ,vol. 38, pp. 39- 41 ,(1995) , 10.1145/219717.219748