Toward Geographic Information Harvesting: Extraction of Spatial Relational Facts from Web Documents

作者: Corrado Loglisci , Dino Ienco , Mathieu Roche , Maguelonne Teisseire , Donato Malerba

DOI: 10.1109/ICDMW.2012.20

关键词:

摘要: This paper faces the problem of harvesting geographic information from Web documents, specifically, extracting facts on spatial relations among places. The motivation is twofold. First, researchers Spatial Data Mining often assume that data are already available, thanks to current GIS and positioning technologies. Nevertheless, this not applicable case embedded in without an explicit modeling, such as documents. Second, despite huge amount documents conveying useful information, there much work how harvest these particularly challenging because lack annotated which prevents application supervised learning techniques. In paper, we propose places through unsupervised approach recognizes supposing availability proposed based combined use a ontology prototype-based classifier. A study topological directional reported commented.

参考文章(21)
André Blessing, Hinrich Schütze, Fine-Grained Geographical Relation Extraction from Wikipedia. language resources and evaluation. ,(2010)
Corrado Loglisci, Dino Ienco, Mathieu Roche, Maguelonne Teisseire, Donato Malerba, An Unsupervised Framework for Topological Relations Extraction from Geographic Documents Lecture Notes in Computer Science. pp. 48- 55 ,(2012) , 10.1007/978-3-642-32597-7_5
H. Garcia-Molina, O. Buyukokkten, N. Shivakumar, J. Cho, L. Gravano, Exploiting Geographical Location Information of Web Pages WebDB (Informal Proceedings). pp. 91- 96 ,(1999)
Bill MacCartney, Marie-Catherine de Marneffe, Christopher D. Manning, Generating Typed Dependency Parses from Phrase Structure Parses language resources and evaluation. pp. 449- 454 ,(2006)
Dekang Lin, An Information-Theoretic Definition of Similarity international conference on machine learning. pp. 296- 304 ,(1998)
MICHAEL F. WORBOYS, A generic model for planar geographical objects International Journal of Geographic Information Systems. ,vol. 6, pp. 353- 372 ,(1992) , 10.1080/02693799208901920
Ndapandula Nakashole, Martin Theobald, Gerhard Weikum, Scalable knowledge harvesting with high precision and high recall web search and data mining. pp. 227- 236 ,(2011) , 10.1145/1935826.1935869
Parisa Kordjamshidi, Martijn Van Otterlo, Marie-Francine Moens, Spatial role labeling: Towards extraction of spatial relations from natural language ACM Transactions on Speech and Language Processing. ,vol. 8, pp. 4- ,(2011) , 10.1145/2050104.2050105
Ian Niles, Adam Pease, Towards a standard upper ontology Proceedings of the international conference on Formal Ontology in Information Systems - FOIS '01. pp. 2- 9 ,(2001) , 10.1145/505168.505170
George A. Miller, WordNet Communications of the ACM. ,vol. 38, pp. 39- 41 ,(1995) , 10.1145/219717.219748