A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching

作者: Mariona Coll Ardanuy , Kasra Hosseini , Katherine McDonough , Amrey Krause , Daniel van Strien

DOI: 10.1145/3397536.3422236

关键词:

摘要: Recognizing toponyms and resolving them to their real-world referents is required provide advanced semantic access textual data. This process often hindered by the high degree of variation in toponyms. Candidate selection task identifying potential entities that can be referred a previously recognized toponym. While it has traditionally received little attention, candidate significant impact on downstream tasks (i.e. entity resolution), especially noisy or non-standard text. In this paper, we introduce deep learning method for through toponym matching, using state-of-the-art neural network architectures. We perform an intrinsic matching evaluation based several datasets, which cover various challenging scenarios (cross-lingual regional variations, as well OCR errors) assess its performance context geographical English Spanish.

参考文章(16)
Saad Aloteibi, Mark Sanderson, Analyzing geographic query reformulation: An exploratory study Journal of the Association for Information Science and Technology. ,vol. 65, pp. 13- 24 ,(2014) , 10.1002/ASI.22961
Anthony Morana, Thomas Morel, Bilal Berjawi, Fabien Duchateau, GeoBench: a geospatial integration tool for building a spatial entity matching benchmark advances in geographic information systems. pp. 533- 536 ,(2014) , 10.1145/2666310.2666362
Zhiyuan Cheng, James Caverlee, Kyumin Lee, You are where you tweet: a content-based approach to geo-locating twitter users conference on information and knowledge management. pp. 759- 768 ,(2010) , 10.1145/1871437.1871535
Michael D. Lieberman, Gianluca Quercini, Hanan Samet, Jagan Sankaranarayanan, Determining the spatial reader scopes of news sources using local lexicons advances in geographic information systems. pp. 43- 52 ,(2010) , 10.1145/1869790.1869800
Ian Gregory, Christopher Donaldson, Patricia Murrieta-Flores, Paul Rayson, Geoparsing, GIS, and Textual Analysis: Current Developments in Spatial Humanities Research International Journal of Humanities and Arts Computing. ,vol. 9, pp. 1- 14 ,(2015) , 10.3366/IJHAC.2015.0135
Qingqing Gan, Josh Attenberg, Alexander Markowetz, Torsten Suel, Analysis of geographic queries in a search engine log Proceedings of the first international workshop on Location and the web - LOCWEB '08. pp. 49- 56 ,(2008) , 10.1145/1367798.1367806
Ben Hachey, Will Radford, Joel Nothman, Matthew Honnibal, James R. Curran, Evaluating Entity Linking with Wikipedia Artificial Intelligence. ,vol. 194, pp. 130- 150 ,(2013) , 10.1016/J.ARTINT.2012.04.005
Jason Baldridge, Grant DeLozier, Loretta London, Gazetteer-independent toponym resolution using geographic word profiles national conference on artificial intelligence. pp. 2382- 2388 ,(2015)
Max M. Louwerse, Gabriel Recchia, A Comparison of String Similarity Measures for Toponym Matching COMP@SIGSPATIAL. pp. 54- 61 ,(2013)
James O. Butler, Christopher E. Donaldson, Joanna E. Taylor, Ian N. Gregory, Alts, Abbreviations, and AKAs: Historical Onomastic Variation and Automated Named Entity Recognition Journal of Map and Geography Libraries. ,vol. 13, pp. 58- 81 ,(2017) , 10.1080/15420353.2017.1307304