作者: Murtaza Muidul Huda Chowdhury , Satish J. Thomas
DOI:
关键词:
摘要: A pair of records is tokenized to form a normalized representation an entity represented by each record. The tokens are correlated machine learning system determining whether learned resolution already exists for the two entities. If not, compared generate comparison measure determine match. can also be used perform web search and results as additional matching. When match found, updated indicate that they match, provided update resolutions.