Interactive Entity Resolution in Relational Data: A Visual Analytic Tool and Its Evaluation

作者: Hyunmo Kang , L. Getoor , B. Shneiderman , M. Bilgic , L. Licamele

DOI: 10.1109/TVCG.2008.55

关键词:

摘要: Databases often contain uncertain and imprecise references to real-world entities. Entity resolution, the process of reconciling multiple underlying entities, is an important data cleaning required before accurate visualization or analysis possible. In many cases, in addition noisy describing there relationships among This relational during entity resolution process; it useful both for algorithms which determine likely database be resolved visual analytic tools support process. this paper, we introduce a novel user interface, D-Dupe, interactive data. D-Dupe effectively combines with network that enables users make use entity's context making decisions. Since decisions are interdependent, facilitates understanding complex through animations highlight combined inferences history mechanism allows inspect chains An empirical study 12 confirmed benefits on performance tasks terms time as well users' confidence satisfaction.

参考文章(38)
Joshua O’Madadhain, Danyel Fisher, Padhraic Smyth, Yan-Biao Boey, Analysis and Visualization of Network Data using JUNG ,(2005)
Dmitri V. Kalashnikov, Sharad Mehrotra, Zhaoqi Chen, Exploiting relationships for domain-independent data cleaning † siam international conference on data mining. pp. 262- 273 ,(2005)
Linton C. Freeman, Visualizing Social Networks. Journal of Social Structure. ,vol. 1, ,(2000)
Tamraparni Dasu, Theodore Johnson, Exploratory Data Mining and Data Cleaning ,(2003)
Rohit Ananthakrishna, Surajit Chaudhuri, Venkatesh Ganti, Eliminating fuzzy duplicates in data warehouses very large data bases. pp. 586- 597 ,(2002) , 10.1016/B978-155860869-6/50058-5
Indrajit Bhattacharya, Lise Getoor, Entity Resolution in Graphs John Wiley & Sons, Inc.. pp. 311- 344 ,(2005) , 10.1002/9780470073049.CH13
Stephen E. Fienberg, William W. Cohen, Pradeep Ravikumar, A comparison of string distance metrics for name-matching tasks international joint conference on artificial intelligence. pp. 73- 78 ,(2003)
Michael Baur, Marc Benkert, Ulrik Brandes, Sabine Cornelsen, Marco Gaertler, Boris Köpf, Jürgen Lerner, Dorothea Wagner, Visone Software for Visual Social Network Analysis graph drawing. pp. 463- 464 ,(2001) , 10.1007/3-540-45848-4_47
Glenn E. Krasner, Stephen T. Pope, A cookbook for using the model-view controller user interface paradigm in Smalltalk-80 Journal of Object-oriented Programming. ,vol. 1, pp. 26- 49 ,(1988) , 10.5555/50757.50759