作者: Lifang Gu , Deanne Vickers , Chris Rainsford , Rohan Baxter
DOI:
关键词:
摘要: Record linkage is the task of quickly and accurately identifying records corresponding to same entity from one or more data sources. also known as cleaning, reconciliation identification merge/purge problem. This paper presents “standard” probabilistic record model associated algorithm. Recent work in information retrieval, federated database systems mining have proposed alternatives key components standard The impact these on approach are assessed. question whether how new better terms time, accuracy degree automation for a particular application.