Generating multi-type sequences of temporal events to improve fraud detection in game advertising.

作者: Lun Jiang , Nima Salehi Sadghiani , Zhuo Tao , None

DOI:

关键词:

摘要: Fraudulent activities related to online advertising can potentially harm the trust advertisers put in networks and sour gaming experience for users. Pay-Per-Click/Install (PPC/I) is one of main revenue models game monetization. Widespread use PPC/I model has led a rise click/install fraud events games. The majority traffic ad non-fraudulent, which imposes difficulties on machine learning based detection systems deal with highly skewed labels. From network standpoint, user are multi-type sequences temporal consisting event types corresponding time intervals. Time Long Short-Term Memory (Time-LSTM) cells have been proved effective modeling intrinsic hidden patterns non-uniform In this study, we propose using variant Time-LSTM combination modified version Sequence Generative Adversarial (SeqGAN)to generate artificial mimic fraudulent traffic. We also Critic instead Monte-Carlo (MC) roll-out training SeqGAN reduce computational costs. GAN-generated be used enhance classification ability event-based classifiers. Our extensive experiments synthetic data shown trained generator capability desired properties measured by multiple criteria.

参考文章(34)
Alex Graves, Generating Sequences With Recurrent Neural Networks arXiv: Neural and Evolutionary Computing. ,(2013)
, Generative Adversarial Nets neural information processing systems. ,vol. 27, pp. 2672- 2680 ,(2014) , 10.3156/JSOFT.29.5_177_2
A.G. Barto, R.S. Sutton, Reinforcement Learning: An Introduction ,(1988)
Richard Oentaryo, Ee-Peng Lim, Michael Finegold, David Lo, Feida Zhu, Clifton Phua, Eng-Yeow Cheu, Ghim-Eng Yap, Kelvin Sim, Minh Nhut Nguyen, Kasun Perera, Bijay Neupane, Mustafa Faisal, Zeyar Aung, Wei Lee Woon, Wei Chen, Dhaval Patel, Daniel Berrar, None, Detecting click fraud in online advertising: a data mining approach Journal of Machine Learning Research. ,vol. 15, pp. 99- 140 ,(2014)
Soumith Chintala, Alec Radford, Luke Metz, Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks arXiv: Learning. ,(2015)
Geoffrey Hinton, Laurens van der Maaten, Visualizing Data using t-SNE Journal of Machine Learning Research. ,vol. 9, pp. 2579- 2605 ,(2008)
Elie Bursztein, Panayiotis Mavrommatis, Nav Jagpal, Chris Sharp, Damon McCoy, Lucas Ballard, Moheeb Abu Rajab, Robert Shield, Niels Provos, Kurt Thomas, Juan A. Elices Crespo, Ryan Rasti, Cait Phillips, Ali Tofigh, Marc Antoine Courteau, Fabio Tirelo, Jean Michel Picod, Marc André Decoste, Investigating Commercial Pay-Per-Install and the Distribution of Unwanted Software usenix security symposium. pp. 721- 739 ,(2016)
Kawaljeet Kaur Kapoor, Yogesh K Dwivedi, Niall C Piercy, None, Pay-per-click advertising: A literature review The Marketing Review. ,vol. 16, pp. 183- 202 ,(2016) , 10.1362/146934716X14636478977557
Michael Pfeiffer, Daniel Neil, Shih-Chii Liu, Phased LSTM: Accelerating Recurrent Network Training for Long or Event-based Sequences neural information processing systems. ,vol. 29, pp. 3882- 3890 ,(2016) , 10.5167/UZH-149394
R. Fortet, E. Mourier, Convergence de la répartition empirique vers la répartition théorique Annales Scientifiques De L Ecole Normale Superieure. ,vol. 70, pp. 267- 285 ,(1953) , 10.24033/ASENS.1013