Differentially private data publishing via cross-moment microaggregation

作者: Javier Parra-Arnau , Josep Domingo-Ferrer , Jordi Soria-Comas

DOI: 10.1016/J.INFFUS.2019.06.011

关键词:

摘要: Abstract Differential privacy is one of the most prominent notions in field anonymization. However, its strong guarantees very often come at expense significantly degrading utility protected data. To cope with this, numerous mechanisms have been studied that reduce sensitivity data and hence noise required to satisfy this notion. In paper, we present a generalization classical microaggregation, where aggregated records are replaced by group mean additional statistical measures, purpose evaluating it as reduction mechanism. We propose an anonymization methodology for numerical microdata which target protection set microaggregated generalized way, disclosure risk limitation guaranteed through differential via record-level perturbation. Specifically, describe three algorithms microaggregation can be applied either entire or groups attributes independently. Our theoretical analysis computes sensitivities first two central cross moments; apply fundamental results from matrix perturbation theory derive bounds on eigenvalues eigenvectors covariance coskewness matrices. extensive experimental evaluation shows enhanced medium large sizes groups. For range sizes, find evidence our approach provide not only higher but also than traditional microaggregation.

参考文章(32)
David Sánchez, Josep Domingo-Ferrer, Sergio Martínez, Improving the Utility of Differential Privacy via Univariate Microaggregation privacy in statistical databases. pp. 130- 142 ,(2014) , 10.1007/978-3-319-11257-2_11
Adelchi Azzalini, A class of distributions which includes the normal ones Scandinavian Journal of Statistics. ,vol. 12, pp. 171- 178 ,(1985)
Rathindra Sarathy, Krishnamurty Muralidhar, Evaluating Laplace Noise Addition to Satisfy Differential Privacy for Numeric Data Transactions on Data Privacy. ,vol. 4, pp. 1- 17 ,(2011)
Jaewoo Lee, Chris Clifton, How much is enough? choosing ε for differential privacy international conference on information security. pp. 325- 340 ,(2011) , 10.1007/978-3-642-24861-0_22
Anco Hundepool, Josep Domingo-Ferrer, Luisa Franconi, Sarah Giessing, Eric Schulte Nordholt, Keith Spicer, Peter-Paul De Wolf, Statistical Disclosure Control ,(2012)
Ji-guang Sun, G. W. Stewart, Matrix perturbation theory ,(1990)
Anna Oganian, Josep Domingo-Ferrer, On the complexity of optimal microaggregation for statistical disclosure control Statistical journal of the United Nations economic commission for Europe. ,vol. 18, pp. 345- 354 ,(2001) , 10.3233/SJU-2001-18409
Cynthia Dwork, Krishnaram Kenthapadi, Frank McSherry, Ilya Mironov, Moni Naor, Our Data, Ourselves: Privacy Via Distributed Noise Generation Advances in Cryptology - EUROCRYPT 2006. ,vol. 4004, pp. 486- 503 ,(2006) , 10.1007/11761679_29
Chao Li, Michael Hay, Gerome Miklau, Yue Wang, A data- and workload-aware algorithm for range queries under differential privacy Proceedings of the VLDB Endowment. ,vol. 7, pp. 341- 352 ,(2014) , 10.14778/2732269.2732271