A Quantitative Analysis of the Performance and Scalability of De-identification Tools for Medical Data

作者: Zhiming Liu , Nafees Qamar , Jie Qian

DOI: 10.1007/978-3-642-53956-5_18

关键词:

摘要: Recent developments in data de-identification technologies offer sophisticated solutions to protect medical when, especially the is be provided for secondary purposes such as clinical or biomedical research. So determine what degree an approach--- along with its tool--- usable and effective, this paper takes into consideration a number of tools that aim at reducing re-identification risk published data, yet preserving statistical meanings. We therefore evaluate residual by conducting experimental evaluation most stable research-based tools, applied our Electronic Health Records EHRs database, assess which tool exhibits better performance different quasi-identifiers. Our criteria are quantitative opposed other descriptive qualitative assessments. notice on comparing individual disclosure information loss each μ-Argus performs better. Also, generalization method considerably than suppression terms avoiding loss. also find sdcMicro has best scalability among counterparts, been observed experimentally virtual consisted 33 variables 10,000 records.

参考文章(19)
Nafees Qamar, Johannes Faber, Yves Ledru, Zhiming Liu, Automated Reviewing of Healthcare Security Policies FHIES 2012 - 2nd International Symposium on the Foundations of Health Information Engineering and Systems. ,vol. 7789, pp. 176- 193 ,(2012) , 10.1007/978-3-642-39088-3_12
Latanya Sweeney, Simple Demographics Often Identify People Uniquely Carnegie Mellon University. ,(2000) , 10.1184/R1/6625769.V1
Petra Knaup, Evelyn J. S. Hovenga, Jasmin Buck, Sebastian Garde, Ubiquitous information for ubiquitous computing: expressing clinical data sets with openEHR archetypes. medical informatics europe. ,vol. 124, pp. 215- 220 ,(2006)
Hal Abelson, Latanya Arvette Sweeney, Computational disclosure control: a primer on data privacy protection Massachusetts Institute of Technology. ,(2001)
Matthias Templ, Statistical Disclosure Control for Microdata Using the R-Package sdcMicro Transactions on Data Privacy. ,vol. 1, pp. 67- 85 ,(2008)
Pierangela Samarati, Latanya Sweeney, Generalizing data to provide anonymity when disclosing information (abstract) symposium on principles of database systems. pp. 188- ,(1998) , 10.1145/275487.275508
Tiancheng Li, Ninghui Li, Optimal k-Anonymity with Flexible Generalization Schemes through Bottom-up Searching international conference on data mining. pp. 518- 523 ,(2006) , 10.1109/ICDMW.2006.127
Xiaokui Xiao, Guozhang Wang, Johannes Gehrke, Interactive anonymization of sensitive data international conference on management of data. pp. 1051- 1054 ,(2009) , 10.1145/1559845.1559979
Todd Fitzgerald, Building Management Commitment through Security Councils Information Systems Security. ,vol. 35, pp. 1- 14 ,(2007) , 10.1080/07366980701369577