Secure Blocking + Secure Matching = Secure Record Linkage

作者: Alexandros Karakasidis , Vassilios S. Verykios

DOI: 10.5626/JCSE.2011.5.3.223

关键词:

摘要: Performing approximate data matching has always been an intriguing problem for both industry and academia. This task becomes even more challenging when the requirement of privacy rises. In this paper, we propose a novel technique to address efficient privacy-preserving record linkage. The secure framework consists two basic components. First, utilize blocking component based on phonetic algorithms statistically enhanced improve security. Second, use where actual is performed using private approach Levenshtein Distance algorithm. Our goal combine speed with increased accuracy matching. Category: Ubiquitous computing; Security

参考文章(39)
Panagiotis G. Ipeirotis, Nick Koudas, H. V. Jagadish, Divesh Srivastava, Luis Gravano, Lauri Pietarinen, S. Muthukrishnan, Using q-grams in a DBMS for Approximate String Processing. IEEE Data(base) Engineering Bulletin. ,vol. 24, pp. 28- 34 ,(2001)
Oded Goldreich, Foundations of Cryptography Cambridge University Press. ,(2001) , 10.1017/CBO9780511546891
Rob Hall, Stephen E. Fienberg, Privacy-preserving record linkage privacy in statistical databases. pp. 269- 283 ,(2010) , 10.1007/978-3-642-15838-4_24
Tim Churches, Peter Christen, Blind Data Linkage Using n-gram Similarity Comparisons Advances in Knowledge Discovery and Data Mining. pp. 121- 126 ,(2004) , 10.1007/978-3-540-24775-3_15
Mauricio A. Hernández, Salvatore J. Stolfo, Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem Data Mining and Knowledge Discovery. ,vol. 2, pp. 9- 37 ,(1998) , 10.1023/A:1009761603038
Stephen E. Fienberg, William W. Cohen, Pradeep Ravikumar, A comparison of string distance metrics for name-matching tasks international joint conference on artificial intelligence. pp. 73- 78 ,(2003)
V. I. Levenshtein, Binary codes capable of correcting deletions, insertions, and reversals Soviet physics. Doklady. ,vol. 10, pp. 707- 710 ,(1966)