Graph-based malware distributors detection

作者: Andrei Venzhega , Polina Zhinalieva , Nikolay Suboch

DOI: 10.1145/2487788.2488136

关键词:

摘要: Search engines are currently facing a problem of websites that distribute malware. In this paper we present novel efficient algorithm learns to detect such kind spam. We have used bipartite graph with two types nodes, each representing layer in the graph: web-sites and file hostings (FH), connected edges fact can be downloaded from hosting via link on web-site. The performance spam detection method was verified using set ground truth labels: manual assessments antivirus analysts automatically generated obtained companies. demonstrate proposed is able new malware even before best known solutions them.

参考文章(11)
Andrew McCallum, David Cohn, Huan Chang, Learning to Create Customized Authority Lists international conference on machine learning. pp. 127- 134 ,(2000)
Zoltán Gyöngyi, Hector Garcia-Molina, Jan Pedersen, Combating web spam with trustrank very large data bases. pp. 576- 587 ,(2004) , 10.1016/B978-012088469-8.50052-8
Yanfang Ye, Tao Li, Shenghuo Zhu, Weiwei Zhuang, Egemen Tas, Umesh Gupta, Melih Abdulhayoglu, Combining file content and file relations for cloud based malware detection Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '11. pp. 222- 230 ,(2011) , 10.1145/2020408.2020448
Manuel Egele, Theodoor Scholte, Engin Kirda, Christopher Kruegel, A survey on automated dynamic malware-analysis techniques and tools ACM Computing Surveys. ,vol. 44, pp. 6- ,(2008) , 10.1145/2089125.2089126
Sergey Brin, Lawrence Page, The anatomy of a large-scale hypertextual Web search engine the web conference. ,vol. 30, pp. 107- 117 ,(1998) , 10.1016/S0169-7552(98)00110-X
Jacob Abernethy, Olivier Chapelle, Carlos Castillo, Web spam identification through content and hyperlinks Proceedings of the 4th international workshop on Adversarial information retrieval on the web - AIRWeb '08. pp. 41- 44 ,(2008) , 10.1145/1451983.1451994
Jon M. Kleinberg, Authoritative sources in a hyperlinked environment Journal of the ACM. ,vol. 46, pp. 604- 632 ,(1999) , 10.1145/324133.324140
Shashank Pandit, Duen Horng Chau, Samuel Wang, Christos Faloutsos, Netprobe: a fast and scalable system for fraud detection in online auction networks the web conference. pp. 201- 210 ,(2007) , 10.1145/1242572.1242600
Bin Gao, Tie-Yan Liu, Wei Wei, Taifeng Wang, Hang Li, Semi-supervised ranking on very large graphs with rich metadata Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '11. pp. 96- 104 ,(2011) , 10.1145/2020408.2020430
Jeffrey Dean, Sanjay Ghemawat, MapReduce Communications of the ACM. ,vol. 51, pp. 107- 113 ,(2008) , 10.1145/1327452.1327492