Feature Selection for Website Fingerprinting

作者: Junhua Yan , Jasleen Kaur

DOI: 10.1515/POPETS-2018-0039

关键词:

摘要: Website fingerprinting based on TCP/IP headers is of significant relevance to several Internet entities. Prior work has focused only on a limited set of features, and does not help understand the extents of fingerprint-ability. We address this by conducting an exhaustive feature analysis within eight different communication scenarios. Our analysis helps reveal several previously-unknown features in several scenarios, that can be used to fingerprint websites with much higher accuracy than previously demonstrated. This work helps the community better understand the extents of learnability (and vulnerability) from TCP/IP headers.

参考文章(56)
Vinicius Gehlen, Alessandro Finamore, Marco Mellia, Maurizio M. Munafò, Uncovering the big players of the web traffic monitoring and analysis. pp. 15- 28 ,(2012) , 10.1007/978-3-642-28534-9_2
Brad Miller, Ling Huang, A. D. Joseph, J. D. Tygar, I Know Why You Went to the Clinic: Risks and Realization of HTTPS Traffic Analysis privacy enhancing technologies. pp. 143- 163 ,(2014) , 10.1007/978-3-319-08506-7_8
Sean Sanders, Jasleen Kaur, Can web pages be classified using anonymized TCP/IP headers? 2015 IEEE Conference on Computer Communications (INFOCOM). pp. 2272- 2280 ,(2015) , 10.1109/INFOCOM.2015.7218614
Ramón Díaz-Uriarte, Sara Alvarez de Andrés, Gene selection and classification of microarray data using random forest BMC Bioinformatics. ,vol. 7, pp. 3- 3 ,(2006) , 10.1186/1471-2105-7-3
Liming Lu, Ee-Chien Chang, Mun Choon Chan, Website fingerprinting and identification using ordered feature sequences european symposium on research in computer security. pp. 199- 214 ,(2010) , 10.1007/978-3-642-15497-3_13
L Brooke Hayward, Jonathan Segal, Paul Van Eerdewegh, Kathryn L Lunetta, Screening large-scale association study data: exploiting interactions using random forests BMC Genetics. ,vol. 5, pp. 32- 32 ,(2004) , 10.1186/1471-2156-5-32
Fabian Monrose, Scott E. Coull, Charles V. Wright, Traffic Morphing: An Efficient Defense Against Statistical Traffic Analysis. network and distributed system security symposium. ,(2009)
Douglas W. Jones, Tom C. Bowersox, Secure data export and auditing using data diodes conference on electronic voting technology workshop on trustworthy elections. pp. 4- 4 ,(2006)
Roger Dingledine, Nick Mathewson, Paul Syverson, Tor: the second-generation onion router usenix security symposium. pp. 21- 21 ,(2004) , 10.21236/ADA465464
Hongying Jiang, Youping Deng, Huann-Sheng Chen, Lin Tao, Qiuying Sha, Jun Chen, Chung-Jui Tsai, Shuanglin Zhang, Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes BMC Bioinformatics. ,vol. 5, pp. 81- 81 ,(2004) , 10.1186/1471-2105-5-81