GSP (Geo-Semantic-Parsing): Geoparsing and Geotagging with Machine Learning on Top of Linked Data

作者: Marco Avvenuti , Stefano Cresci , Leonardo Nizzoli , Maurizio Tesconi

DOI: 10.1007/978-3-319-93417-4_2

关键词: ParsingContent (measure theory)Benchmark (computing)Artificial intelligenceExploitGeotaggingGeospatial analysisLinked dataMachine learningComputer scienceGeoparsing

摘要: Recently, user-generated content in social media opened up new alluring possibilities for understanding the geospatial aspects of many real-world phenomena. Yet, vast majority such lacks explicit, structured geographic information. Here, we describe design and implementation a novel approach associating information to text documents. GSP exploits powerful machine learning algorithms on top rich, interconnected Linked Data order overcome limitations previous state-of-the-art approaches. In detail, our technique performs semantic annotation identify relevant tokens input document, traverses sub-graph extracting possible related identified optimizes its results by means Support Vector Machine classifier. We compare with those 4 techniques baselines ground-truth data from 2 evaluation datasets. Our achieves excellent performances, best \(F1 = 0.91\), sensibly outperforming benchmark that achieve \le 0.78\).

参考文章(23)
Laurens Rietveld, Rinke Hoekstra, Stefan Schlobach, Christophe Guéret, Structural Properties as Proxy for Semantic Relevance in RDF Graph Sampling The Semantic Web – ISWC 2014. ,vol. 8797, pp. 81- 96 ,(2014) , 10.1007/978-3-319-11915-1_6
Shane Bergsma, Mark Dredze, Michael J. Paul, Hieu Tran, Carmen: A Twitter Geolocation System with Applications to Public Health ,(2013)
Ricardo Usbeck, Axel-Cyrille Ngonga Ngomo, Michael Röder, Daniel Gerber, Sandro Athaide Coelho, Sören Auer, Andreas Both, None, AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data The Semantic Web – ISWC 2014. pp. 457- 471 ,(2014) , 10.1007/978-3-319-11964-9_29
Li Ding, Joshua Shinavier, Zhenning Shangguan, Deborah L. McGuinness, SameAs networks and beyond: analyzing deployment status and implications of owl:sameAs in linked data international semantic web conference. pp. 145- 160 ,(2010) , 10.1007/978-3-642-17746-0_10
Gerald Töpper, Magnus Knuth, Harald Sack, DBpedia ontology enrichment for inconsistency detection Proceedings of the 8th International Conference on Semantic Systems - I-SEMANTICS '12. pp. 33- 40 ,(2012) , 10.1145/2362499.2362505
Judith Gelernter, Shilpa Balaji, An algorithm for local geoparsing of microtext Geoinformatica. ,vol. 17, pp. 635- 667 ,(2013) , 10.1007/S10707-012-0173-8
Zhiyuan Cheng, James Caverlee, Kyumin Lee, You are where you tweet: a content-based approach to geo-locating twitter users conference on information and knowledge management. pp. 759- 768 ,(2010) , 10.1145/1871437.1871535
Stuart E. Middleton, Lee Middleton, Stefano Modafferi, Real-Time Crisis Mapping of Natural Disasters Using Social Media IEEE Intelligent Systems. ,vol. 29, pp. 9- 17 ,(2014) , 10.1109/MIS.2013.126
Heiko Paulheim, Johannes Fümkranz, Unsupervised generation of data mining features from linked open data web intelligence, mining and semantics. pp. 31- ,(2012) , 10.1145/2254129.2254168
Pablo N. Mendes, Max Jakob, Andrés García-Silva, Christian Bizer, DBpedia spotlight Proceedings of the 7th International Conference on Semantic Systems - I-Semantics '11. pp. 1- 8 ,(2011) , 10.1145/2063518.2063519