Avoiding Chinese Whispers: Controlling End-to-End Join Quality in Linked Open Data Stores

作者: Jan-Christoph Kalo , Silviu Homoceanu , Jewgeni Rose , Wolf-Tilo Balke

DOI: 10.1145/2786451.2786466

关键词:

摘要: Today Linked Open Data is a central trend in information provisioning. collected distributed data stores, individually curated with high quality, and made available over the Web for wide variety of applications providing their own business logic utilization. Thus, key promise to provide holistic view range items or entities. But parallel problems database integration schema matching, linking several sources remains challenge currently severely hampering vision working Semantic Web. One possible solution are instance matching systems that automatically create owl:sameAs links between stores. According existing benchmarks, quality has even reached satisfying level. However, our extensive analysis shows not yet ready large-scale interlinking. This because query processors joining via single incorrectly created link implicitly use also all transitive may turn be mismatched again. The result similar game Chinese Whispers: watered-down sameAs semantics step-by-step lead terrible end-to-end joins. We develop innovative structural mechanisms on top significantly improve processing avoiding Whispers.

参考文章(26)
Ora Lassila, Tim Berners-lee, James A. Hendler, The Semantic Web" in Scientific American ,(2001)
Seung-Won Hwang, Sanghoon Lee, ARIA: asymmetry resistant instance alignment national conference on artificial intelligence. pp. 94- 100 ,(2014)
Silviu Homoceanu, Jan-Christoph Kalo, Wolf-Tilo Balke, Putting Instance Matching to the Test: Is Instance Matching Ready for Reliable Data Linking? Lecture Notes in Computer Science. pp. 274- 284 ,(2014) , 10.1007/978-3-319-08326-1_28
Michael L. Dertouzos, Tim Berners-Lee, Mark Fischetti, Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by Its Inventor Harper San Francisco. ,(1999)
Stefano Montanelli, Davide Lorusso, Gaia Varese, Alflo Ferrara, Towards a benchmark for instance matching international conference on ontology matching. ,vol. 431, pp. 37- 48 ,(2008)
Fabian M. Suchanek, Serge Abiteboul, Pierre Senellart, PARIS Proceedings of the VLDB Endowment. ,vol. 5, pp. 157- 168 ,(2011) , 10.14778/2078331.2078332
Li Ding, Joshua Shinavier, Zhenning Shangguan, Deborah L. McGuinness, SameAs networks and beyond: analyzing deployment status and implications of owl:sameAs in linked data international semantic web conference. pp. 145- 160 ,(2010) , 10.1007/978-3-642-17746-0_10
M. Girvan, M. E. J. Newman, Community structure in social and biological networks Proceedings of the National Academy of Sciences of the United States of America. ,vol. 99, pp. 7821- 7826 ,(2002) , 10.1073/PNAS.122653799
Simon Lacoste-Julien, Konstantina Palla, Alex Davies, Gjergji Kasneci, Thore Graepel, Zoubin Ghahramani, None, SIGMa: simple greedy matching for aligning large knowledge bases knowledge discovery and data mining. pp. 572- 580 ,(2013) , 10.1145/2487575.2487592
Isabel F. Cruz, Flavio Palandri Antonelli, Cosmin Stroe, AgreementMaker: efficient matching for large real-world schemas and ontologies very large data bases. ,vol. 2, pp. 1586- 1589 ,(2009) , 10.14778/1687553.1687598