作者: Michal Nykl , Tomáš Brychcín , Michal Konkol , Tomáš Hercig
DOI:
关键词: Error analysis 、 Visualization 、 Word (computer architecture) 、 Set (abstract data type) 、 Semantic space 、 Natural language processing 、 Computer science 、 Position (vector) 、 Artificial intelligence
摘要: Word embeddings are commonly compared either with human-annotated word similarities or through improvements in natural language processing tasks. We propose a novel principle which compares the information from reality. implement this by comparing geographical positions of cities. Our evaluation linearly transforms semantic space to optimally fit real cities and measures deviation between position given position. A set well-known state-of-the-art results were evaluated. also introduce visualization that helps error analysis.