作者: Stella Margonar
DOI:
关键词:
摘要: The ability of identifying whether two strings represent names referring to the same real world entity is essential for avoiding information integration problems, such as duplication records. We study this problem in a scenario where amount data analyze becomes large. Our purpose develop framework that address name match and search problem, combining together different strategies, able consider also semantic string representing name. Moreover we propose dataset evaluating matching algorithm which variation names.