Discrete-State Variational Autoencoders for Joint Discovery and Factorization of Relations

作者: Diego Marcheggiani , Ivan Titov

DOI: 10.1162/TACL_A_00095

关键词: Hierarchical clusteringPreference (economics)Independence (probability theory)Relation (database)Theoretical computer scienceGenerative grammarFactorizationState (functional analysis)Machine learningContrast (statistics)Artificial intelligenceComputer science

摘要: We present a method for unsupervised open-domain relation discovery. In contrast to previous (mostly generative and agglomerative clustering) approaches, our model relies on rich contextual features makes minimal independence assumptions. The is composed of two parts: feature-rich extractor, which predicts semantic between entities, factorization model, reconstructs arguments (i.e., the entities) relying predicted relation. components are estimated jointly so as minimize errors in recovering arguments. study models inspired by work selectional preference modeling. Our substantially outperform agglomerative-clustering counterparts achieve state-of-the-art performance.

参考文章(61)
Lise Getoor, Ben Taskar, Introduction to statistical relational learning MIT Press. ,(2007)
Tommi S. Jaakkola, Michael I. Jordan, Computing upper and lower bounds on likelihoods in intractable networks uncertainty in artificial intelligence. pp. 340- 348 ,(1996)
Sebastian Riedel, Limin Yao, Andrew McCallum, Modeling relations and their mentions without labeled text european conference on machine learning. pp. 148- 163 ,(2010) , 10.1007/978-3-642-15939-8_10
Tomas Mikolov, Greg S. Corrado, Kai Chen, Jeffrey Dean, Efficient Estimation of Word Representations in Vector Space international conference on learning representations. ,(2013)
Alberto García-Durán, Antoine Bordes, Nicolas Usunier, Effective blending of two and three-way interactions for modeling multi-relational data european conference on machine learning. pp. 434- 449 ,(2014) , 10.1007/978-3-662-44848-9_28
Nicolas Usunier, Jason Weston, Antoine Bordes, Oksana Yakhnenko, Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction empirical methods in natural language processing. pp. 1366- 1371 ,(2013)
Benjamin M. Marlin, Sebastian Riedel, Andrew McCallum, Limin Yao, Relation Extraction with Matrix Factorization and Universal Schemas north american chapter of the association for computational linguistics. pp. 74- 84 ,(2013)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Taylor Berg-Kirkpatrick, Alexandre Bouchard-Côté, John DeNero, Dan Klein, Painless Unsupervised Learning with Features north american chapter of the association for computational linguistics. pp. 582- 590 ,(2010)
Ledyard R Tucker, Some mathematical notes on three-mode factor analysis Psychometrika. ,vol. 31, pp. 279- 311 ,(1966) , 10.1007/BF02289464