Creating Semantic Representations

作者: Finn Årup Nielsen , Lars Kai Hansen

DOI: 10.1007/978-3-030-37250-7_2

关键词: Word embeddingLatent semantic analysisNatural language processingArtificial intelligenceFeature hashingExplicit semantic analysisRepresentation (arts)Random indexingVector space modelSemantic networkComputer science

摘要: In this chapter, we present the vector space model and some ways to further process such a representation: With feature hashing, random indexing, latent semantic analysis, non-negative matrix factorization, explicit analysis word embedding, or text may be associated with distributed representation. Deep learning, networks auxiliary non-linguistic information provide means for creating representations from linguistic data. We point few of methods datasets used evaluate many different algorithms that create representation, also problems representations.

参考文章(54)
Rami Al-Rfou, Bryan Perozzi, Steven Skiena, Polyglot: Distributed Word Representations for Multilingual NLP arXiv: Computation and Language. ,(2013)
Mark Dredze, Kuzman Ganchev, Small Statistical Models by Random Feature Mixing meeting of the association for computational linguistics. pp. 19- 20 ,(2008)
Omer Levy, Yoav Goldberg, Ido Dagan, Improving Distributional Similarity with Lessons Learned from Word Embeddings Transactions of the Association for Computational Linguistics. ,vol. 3, pp. 211- 225 ,(2015) , 10.1162/TACL_A_00134
Daniel D. Lee, H. Sebastian Seung, Learning the parts of objects by non-negative matrix factorization Nature. ,vol. 401, pp. 788- 791 ,(1999) , 10.1038/44565
Kevin Lund, Curt Burgess, Producing high-dimensional semantic spaces from lexical co-occurrence Behavior Research Methods, Instruments, & Computers. ,vol. 28, pp. 203- 208 ,(1996) , 10.3758/BF03204766
Finn �rup Nielsen, Lars Kai Hansen, Modeling of activation data in the BrainMap database: detection of outliers. Human Brain Mapping. ,vol. 15, pp. 146- 156 ,(2002) , 10.1002/HBM.10012
Finn Årup Nielsen, Daniela Balslev, Lars Kai Hansen, Mining the posterior cingulate: segregation between memory and pain components. NeuroImage. ,vol. 27, pp. 520- 532 ,(2005) , 10.1016/J.NEUROIMAGE.2005.04.034
Philipp Koehn, Kevin Knight, Empirical methods for compound splitting conference of the european chapter of the association for computational linguistics. pp. 187- 193 ,(2003) , 10.3115/1067807.1067833