A self-training approach for resolving object coreference on the semantic web

作者: Wei Hu , Jianfeng Chen , Yuzhong Qu

DOI: 10.1145/1963405.1963421

关键词: Information retrievalCoreferenceComputer scienceEquivalence (formal languages)Discriminative modelPrecision and recallInferenceSemantic WebPragmatics

摘要: An object on the Semantic Web is likely to be denoted with multiple URIs by different parties. Object coreference resolution identify "equivalent" that denote same object. Driven Linking Open Data (LOD) initiative, millions of have been explicitly linked owl:sameAs statements, but potentially coreferent ones are still considerable. Existing approaches address problem mainly from two directions: one based upon equivalence inference mandated OWL semantics, which finds semantically probably omits many potential ones; other via similarity computation between property-value pairs, not always accurate enough. In this paper, we propose a self-training approach for Web, leverages classes bridge gap and candidates. For an URI, firstly establish kernel consists owl:sameAs, (inverse) functional properties (max-)cardinalities, then extend such iteratively in terms discriminative pairs descriptions URIs. particular, discriminability learnt statistical measurement, only exploits key characteristics representing object, also takes into account matchability pragmatics. addition, frequent property combinations mined improve accuracy resolution. We implement scalable system demonstrate our achieves good precision recall resolving coreference, both benchmark large-scale datasets.

参考文章(28)
Stefano Montanelli, Davide Lorusso, Alfio Ferrara, Automatic Identity Recognition in The Semantic Web. IRSW. ,(2008)
Nathalie Pernelle, Fatiha Saïs, Marie-Christine Rousset, L2R: a logical method for reference reconciliation national conference on artificial intelligence. pp. 329- 334 ,(2007)
Antoine Isaac, Lourens van der Meij, Stefan Schlobach, Shenghui Wang, An empirical study of instance-based ontology matching international semantic web conference. ,vol. 4825, pp. 253- 266 ,(2007) , 10.1007/978-3-540-76298-0_19
Jan Noessner, Mathias Niepert, Christian Meilicke, Heiner Stuckenschmidt, Leveraging Terminological Structure for Object Reconciliation Lecture Notes in Computer Science. pp. 334- 348 ,(2010) , 10.1007/978-3-642-13489-0_23
Jacopo Urbani, Spyros Kotoulas, Jason Maassen, Frank van Harmelen, Henri Bal, OWL Reasoning with WebPIE: Calculating the Closure of 100 Billion Triples Lecture Notes in Computer Science. pp. 213- 227 ,(2010) , 10.1007/978-3-642-13486-9_15
Shenghui Wang, Gwenn Englebienne, Stefan Schlobach, Learning Concept Mappings from Instance Similarity international semantic web conference. ,vol. 5318, pp. 339- 355 ,(2008) , 10.1007/978-3-540-88564-1_22
Andriy Nikolov, Victoria Uren, Enrico Motta, Anne de Roeck, Refining Instance Coreferencing Results Using Belief Propagation asian semantic web conference. pp. 405- 419 ,(2008) , 10.1007/978-3-540-89704-0_28
Li Ding, Joshua Shinavier, Zhenning Shangguan, Deborah L. McGuinness, SameAs networks and beyond: analyzing deployment status and implications of owl:sameAs in linked data international semantic web conference. pp. 145- 160 ,(2010) , 10.1007/978-3-642-17746-0_10
Harry Halpin, Patrick J. Hayes, James P. McCusker, Deborah L. McGuinness, Henry S. Thompson, When owl: sameAs isn't the same: an analysis of identity in linked data international semantic web conference. pp. 305- 320 ,(2010) , 10.1007/978-3-642-17746-0_20