LINDA

作者: Christoph Böhm , Gerard de Melo , Felix Naumann , Gerhard Weikum

DOI: 10.1145/2396761.2398582

关键词:

摘要: Linked Data has emerged as a powerful way of interconnecting structured data on the Web. However, cross-linkage between sources is not extensive one would hope for. In this paper, we formalize task automatically creating "sameAs" links across in globally consistent manner. Our algorithm, presented multi-core well distributed version, achieves link generation by accounting for joint evidence match. Experiments confirm that our system scales beyond 100 million entities and delivers highly accurate results despite vast heterogeneity daunting scale.

参考文章(16)
Khashayar Rohanimanesh, Michael L. Wick, Andrew McCallum, Aron Culotta, An Entity Based Model for Coreference Resolution siam international conference on data mining. pp. 365- 376 ,(2009)
Julius Volz, Christian Bizer, Martin Gaedke, Georgi Kobilarov, Discovering and Maintaining Links on the Web of Data international semantic web conference. ,vol. 5823, pp. 650- 665 ,(2009) , 10.1007/978-3-642-04930-9_41
Fabian M. Suchanek, Serge Abiteboul, Pierre Senellart, PARIS Proceedings of the VLDB Endowment. ,vol. 5, pp. 157- 168 ,(2011) , 10.14778/2078331.2078332
Melanie Herschel, Felix Naumann, Sascha Szott, Maik Taubert, Scalable Iterative Graph Duplicate Detection IEEE Transactions on Knowledge and Data Engineering. ,vol. 24, pp. 2094- 2108 ,(2012) , 10.1109/TKDE.2011.99
Vibhor Rastogi, Nilesh Dalvi, Minos Garofalakis, Large-scale collective entity matching Proceedings of the VLDB Endowment. ,vol. 4, pp. 208- 218 ,(2011) , 10.14778/1938545.1938546
Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Renée J. Miller, Min Wang, A framework for semantic link discovery over relational data Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09. pp. 1027- 1036 ,(2009) , 10.1145/1645953.1646084
Aidan Hogan, Antoine Zimmermann, Jürgen Umbrich, Axel Polleres, Stefan Decker, Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corpora Journal of Web Semantics. ,vol. 10, pp. 76- 110 ,(2012) , 10.1016/J.WEBSEM.2011.11.002
Steven Euijong Whang, Hector Garcia-Molina, Joint Entity Resolution 2012 IEEE 28th International Conference on Data Engineering. pp. 294- 305 ,(2012) , 10.1109/ICDE.2012.119
Lars Kolb, Andreas Thor, Erhard Rahm, Block-based load balancing for entity resolution with MapReduce Proceedings of the 20th ACM international conference on Information and knowledge management - CIKM '11. pp. 2397- 2400 ,(2011) , 10.1145/2063576.2063976
Hanna Köpcke, Erhard Rahm, Frameworks for entity matching: A comparison data and knowledge engineering. ,vol. 69, pp. 197- 210 ,(2010) , 10.1016/J.DATAK.2009.10.003