Obfuscating Document Stylometry to Preserve Author Anonymity

作者: Gary Kacmarcik , Michael Gamon

DOI: 10.3115/1273073.1273131

关键词:

摘要: This paper explores techniques for reducing the effectiveness of standard authorship attribution so that an author A can preserve anonymity a particular document D. We discuss feature selection and adjustment show how this information be fed back to create new D' which calculated moves away from A. Since it labor intensive adjust in fashion, we attempt quantify amount effort required produce anonymized introduce two levels anonymization: shallow deep. In our test set, anonymization achieved by making 14 changes per 1000 words reduce likelihood identifying as average more than 83%. For deep anonymization, adapt unmasking work Koppel Schler provide feedback allows choose level anonymization.

参考文章(12)
Josyula R. Rao, Pankaj Rohatgi, Can pseudonymity really guarantee privacy usenix security symposium. pp. 7- 7 ,(2000)
Max Chickering, The WinMine Toolkit ,(2017)
BSCH OLKOPF, C Burges, A Smola, Advances in kernel methods: support vector learning international conference on neural information processing. ,(1999) , 10.5555/299094
David Maxwell Chickering, David Heckerman, Christopher Meek, A Bayesian approach to learning Bayesian networks with local structure uncertainty in artificial intelligence. pp. 80- 89 ,(1997)
F. J. Tweedie, S. Singh, D. I. Holmes, Neural network applications in stylometry: The Federalist Papers Computers and the Humanities. ,vol. 30, pp. 1- 10 ,(1996) , 10.1007/BF00054024
Robert A. Bosch, Jason A. Smith, Separating Hyperplanes and the Authorship of the Disputed Federalist Papers American Mathematical Monthly. ,vol. 105, pp. 601- 608 ,(1998) , 10.1080/00029890.1998.12004933
D. I. HOLMES, The Federalist Revisited: New Directions in Authorship Attribution Literary and Linguistic Computing. ,vol. 10, pp. 111- 127 ,(1995) , 10.1093/LLC/10.2.111
Moshe Koppel, Jonathan Schler, Authorship verification as a one-class classification problem Twenty-first international conference on Machine learning - ICML '04. pp. 62- ,(2004) , 10.1145/1015330.1015448
D. I. HOLMES, The Evolution of Stylometry in Humanities Scholarship Literary and Linguistic Computing. ,vol. 13, pp. 111- 117 ,(1998) , 10.1093/LLC/13.3.111