Reconciling malware labeling discrepancy via consensus learning

作者: Ting Wang , Xin Hu , Shicong Meng , Reiner Sailer

DOI: 10.1109/ICDEW.2014.6818308

关键词:

摘要: Anti-virus systems developed by different vendors often demonstrate strong discrepancy in the labels they assign to given malware, which significantly hinders threat intelligence sharing. The key challenge of addressing this stems from difficulty re-standardizing already-in-use systems. In paper we explore a non-intrusive alternative. We propose leverage correlation between malware anti-virus create “consensus” classification system, through can share information without modifying their own labeling conventions. To end, present novel integration framework Latin exploits correspondence participating as reflected heterogeneous at instance-instance, instance-class, and class-class levels. provide results extensive experimental studies using real datasets concrete use cases verify efficacy reconciling discrepancy.

参考文章(6)
Federico Maggi, Andrea Bellini, Guido Salvaneschi, Stefano Zanero, Finding non-trivial malware naming inconsistencies international conference on information systems security. pp. 144- 159 ,(2011) , 10.1007/978-3-642-25560-1_10
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Tom Kelchner, The (in)consistent naming of malcode Computer Fraud & Security. ,vol. 2010, pp. 5- 7 ,(2010) , 10.1016/S1361-3723(10)70007-5
William W. Cohen, Frank Lin, Power Iteration Clustering international conference on machine learning. pp. 655- 662 ,(2010)
Michael Bailey, Jon Oberheide, Jon Andersen, Z Morley Mao, Farnam Jahanian, Jose Nazario, None, Automated classification and analysis of internet malware recent advances in intrusion detection. pp. 178- 197 ,(2007) , 10.1007/978-3-540-74320-0_10
Pierre-Marc Bureau, David Harley, A DOSE BY ANY OTHER NAME ,(2008)