$\textsf{LoPub}$ : High-Dimensional Crowdsourced Data Publication With Local Differential Privacy

作者: Xuebin Ren , Chia-Mu Yu , Weiren Yu , Shusen Yang , Xinyu Yang

DOI: 10.1109/TIFS.2018.2812146

关键词:

摘要: High-dimensional crowdsourced data collected from numerous users produces rich knowledge about our society; however, it also brings unprecedented privacy threats to the participants. Local differential (LDP), a variant of privacy, is recently proposed as state-of-the-art notion. Unfortunately, achieving LDP on high-dimensional publication raises great challenges in terms both computational efficiency and utility. To this end, based expectation maximization (EM) algorithm Lasso regression, we first propose efficient multi-dimensional joint distribution estimation algorithms with LDP. Then, develop local differentially private ( LoPub ) by taking advantage techniques. In particular, correlations among multiple attributes are identified reduce dimensionality data, thus speeding up learning process high Extensive experiments real-world datasets demonstrate that multivariate scheme significantly outperforms existing schemes communication overhead speed. Moreover, can keep, average, 80% 60% accuracy over released support vector machine random forest classification, respectively.

参考文章(35)
Rathindra Sarathy, Krishnamurty Muralidhar, Evaluating Laplace Noise Addition to Satisfy Differential Privacy for Numeric Data Transactions on Data Privacy. ,vol. 4, pp. 1- 17 ,(2011)
Cynthia Dwork, Frank McSherry, Kobbi Nissim, Adam Smith, Calibrating Noise to Sensitivity in Private Data Analysis Theory of Cryptography. ,vol. 3876, pp. 265- 284 ,(2006) , 10.1007/11681878_14
Gergely Ács, Claude Castelluccia, I have a DREAM!: differentially private smart metering information hiding. pp. 118- 132 ,(2011) , 10.1007/978-3-642-24178-9_9
Muhammad Naveed, Erman Ayday, Ellen W. Clayton, Jacques Fellay, Carl A. Gunter, Jean-Pierre Hubaux, Bradley A. Malin, Xiaofeng Wang, Privacy in the Genomic Era ACM Computing Surveys. ,vol. 48, pp. 6- ,(2015) , 10.1145/2767007
Wei-Yen Day, Ninghui Li, Differentially Private Publishing of High-dimensional Data Using Sensitivity Control computer and communications security. pp. 451- 462 ,(2015) , 10.1145/2714576.2714621
SK Srivastava, Randomized response: a survey technique for eliminating evasive answer bias. Journal of the American Statistical Association. ,vol. 60, pp. 63- 69 ,(1965) , 10.1080/01621459.1965.10480775
Tianqing Zhu, Ping Xiong, Gang Li, Wanlei Zhou, None, Correlated Differential Privacy: Hiding Information in Non-IID Data Set IEEE Transactions on Information Forensics and Security. ,vol. 10, pp. 229- 242 ,(2015) , 10.1109/TIFS.2014.2368363
Cynthia Dwork, Aaron Roth, The Algorithmic Foundations of Differential Privacy ,(2014)
John C Duchi, Michael I Jordan, Martin J Wainwright, None, Local Privacy and Statistical Minimax Rates 2013 IEEE 54th Annual Symposium on Foundations of Computer Science. pp. 429- 438 ,(2013) , 10.1109/FOCS.2013.53
Wei Wang, Qian Zhang, Privacy-Preserving Collaborative Spectrum Sensing With Multiple Service Providers IEEE Transactions on Wireless Communications. ,vol. 14, pp. 1011- 1019 ,(2015) , 10.1109/TWC.2014.2363357