Item name normalization

作者: Arkady Borkovsky

DOI:

关键词:

摘要: A computer-implemented approach for processing search queries generally involves normalizing names and descriptions of items. The various forms a name or description an item is referred to as variant. normalized form the name. Item variants that are similar grouped together clusters. Each cluster mapped dictionary created by storing: 1) variant, 2) information obtained from source which associated with 3) mapping maps variant corresponding

参考文章(9)
Ronald Everett Dann, Name resolution in a directory database ,(1990)
Christopher C. Dozier, Paul Thompson, Name Searching and Information Retrieval empirical methods in natural language processing. ,(1997)
Yael Ravin, Roy Jefferson Byrd, Faye Nina Wacholder, Misook A. Choi, Using canonical forms to develop a dictionary of names in a text ,(1996)
George Varghese, Hugh M. Wilkinson, Nigel T. Poole, Compressed prefix matching database searching ,(1990)
Jean-Pierre Chanod, Gregory Grefenstette, Eric Gaussier, Grouping words with equivalent substrings by automatic clustering based on suffix relationships ,(1999)
Christina Carrick, Carolyn Watters, Automatic association of new items Information Processing and Management. ,vol. 33, pp. 615- 632 ,(1997) , 10.1016/S0306-4573(97)00021-6
Nina Wacholder, Yael Ravin, Misook Choi, Disambiguation of Proper Names in Text conference on applied natural language processing. pp. 202- 208 ,(1997) , 10.3115/974557.974587