作者: Richard Glenn Morris , Xinchuan Zeng , David Randal Elkington
DOI:
关键词:
摘要: According to various embodiments of the present invention, an automated technique is implemented for resolving and merging fields accurately reliably, given a set duplicated records that represents same entity. In at least one embodiment, system uses machine learning (ML) method, train model from training data, learn users how efficiently resolve merge fields. method invention builds feature vectors as input its ML method. apply Hierarchical Based Sequencing (HBS) and/or Multiple Output Relaxation (MOR) models in Training data can come any suitable source or combination sources.