EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models.

作者: Hyrum S. Anderson , Phil Roth

DOI:

关键词: Computer scienceMalwareMachine learningDeep learningGradient boostingInformation securityEmberArtificial intelligence

摘要: … Legal restrictions. Malicious binaries are shared generously … are often protected by copyright laws that prevent sharing. Both … the first large public dataset for machine learning malware …

参考文章(21)
Murray Brand, Craig Valli, Andrew Woodward, Malware Forensics: Discovery of the Intent of Deception The Journal of Digital Forensics, Security and Law. ,vol. 5, pp. 31- 42 ,(2010) , 10.15394/JDFSL.2010.1082
Razvan Pascanu, Jack W. Stokes, Hermineh Sanossian, Mady Marinescu, Anil Thomas, Malware classification with recurrent networks international conference on acoustics, speech, and signal processing. pp. 1916- 1920 ,(2015) , 10.1109/ICASSP.2015.7178304
M. Zubair Shafiq, S. Momina Tabish, Fauzan Mirza, Muddassar Farooq, PE-Miner: Mining Structural Information to Detect Malicious Executables in Realtime recent advances in intrusion detection. pp. 121- 141 ,(2009) , 10.1007/978-3-642-04342-0_7
Fred Cohen, Computer viruses Computers & Security. ,vol. 6, pp. 22- 35 ,(1987) , 10.1016/0167-4048(87)90122-2
William W. Cohen, Fast Effective Rule Induction Machine Learning Proceedings 1995. pp. 115- 123 ,(1995) , 10.1016/B978-1-55860-377-6.50023-2
Joshua Saxe, Konstantin Berlin, Deep neural network based malware detection using two dimensional binary program features international conference on malicious and unwanted software. pp. 11- 20 ,(2015) , 10.1109/MALWARE.2015.7413680
George E. Dahl, Jack W. Stokes, Li Deng, Dong Yu, Large-scale malware classification using random projections and neural networks international conference on acoustics, speech, and signal processing. pp. 3422- 3426 ,(2013) , 10.1109/ICASSP.2013.6638293
Sebastian Houben, Johannes Stallkamp, Jan Salmen, Marc Schlipsing, Christian Igel, Detection of traffic signs in real-world images: The German traffic sign detection benchmark international joint conference on neural network. pp. 1- 8 ,(2013) , 10.1109/IJCNN.2013.6706807
Victor Zue, Stephanie Seneff, James Glass, Speech database development at MIT: Timit and beyond Speech Communication. ,vol. 9, pp. 351- 356 ,(1990) , 10.1016/0167-6393(90)90010-7
Kilian Weinberger, Anirban Dasgupta, John Langford, Alex Smola, Josh Attenberg, Feature hashing for large scale multitask learning Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 1113- 1120 ,(2009) , 10.1145/1553374.1553516