Statistical database modeling for privacy preserving database generation

作者: Xintao Wu , Yongge Wang , Yuliang Zheng

DOI: 10.1007/11425274_40

关键词:

摘要: Testing of database applications is great importance. Although various studies have been conducted to investigate testing techniques for design, relatively few efforts made explicitly address the which requires a large amount representative data available. As over live production databases often infeasible in many situations due high risks disclosure confidential information or incorrect updating real data, this paper we problem generating synthetic based on a-priori knowledge about database. Our approach fit general location model using characteristics (e.g., constraints, statistics, rules) extracted from and then generate learnt. may contain be used by attacker derive some information, present analysis method cell suppression technique. effective efficient remove aggregate private during generation.

参考文章(5)
Adrian Dobra, Stephen E. Fienberg, Bounds for cell entries in contingency tables induced by fixed marginal totals with applications to disclosure limitation Statistical journal of the United Nations economic commission for Europe. ,vol. 18, pp. 363- 371 ,(2001) , 10.3233/SJU-2001-18411
John M. Stephens, Meikel Poess, MUDD: a multi-dimensional data generator workshop on software and performance. ,vol. 29, pp. 104- 109 ,(2004) , 10.1145/974043.974060
Xintao Wu, Yongge Wang, Yuliang Zheng, Privacy preserving database application testing Proceeding of the ACM workshop on Privacy in the electronic society - WPES '03. pp. 118- 128 ,(2003) , 10.1145/1005140.1005159
David Chays, Saikat Dan, Phyllis G. Frankl, Filippos I. Vokolos, Elaine J. Weber, A framework for testing database applications international symposium on software testing and analysis. ,vol. 25, pp. 147- 157 ,(2000) , 10.1145/347324.348954