Gender Prediction Based on Data Streams of Smartphone Applications

作者: Yilei Wang , Yuanyang Tang , Jun Ma , Zhen Qin

DOI: 10.1007/978-3-319-22047-5_10

关键词:

摘要: Gender information has great values in personalized service, targeted advertising, recommender systems and other aspects. However, such is kind of private information, that many users are reluctant to share. In this paper, we propose a novel approach predict the users’ gender by analyzing data streams smartphone applications. The proposed assumes certain features extracted from could represent perspective characteristics (e.g., gender). To be more specific, noticed male female have different response time Thus extract key feature – Response-Time application. Moreover, leveraging construct training data, further importing Support Vector Machine (SVM) classifier, verified well predicted. experiments, dataset real world collected 25 volunteers. prediction results can achieve 86.50% Accuracy 86.43% F1-score, respectively. best our knowledge, first was predicted

参考文章(16)
O. Dousse, Olivier Bornet, M. Miettinen, Daniel Gatica-Perez, Trinh-Minh-Tri Do, Blom J., I. Aad, J. Eberle, J. K. Laurila, The Mobile Data Challenge: Big Data for Mobile Computing Research Pervasive Computing. ,(2012)
Ron Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection international joint conference on artificial intelligence. ,vol. 2, pp. 1137- 1143 ,(1995)
Edith G. Smit, Guda Van Noort, Hilde A.M. Voorveld, Understanding online behavioural advertising Computers in Human Behavior. ,vol. 32, pp. 15- 22 ,(2014) , 10.1016/J.CHB.2013.11.008
Ingmar Weber, Alejandro Jaimes, Demographic information flows Proceedings of the 19th ACM international conference on Information and knowledge management - CIKM '10. pp. 1521- 1524 ,(2010) , 10.1145/1871437.1871662
Jian Hu, Hua-Jun Zeng, Hua Li, Cheng Niu, Zheng Chen, Demographic prediction based on user's browsing behavior the web conference. pp. 151- 160 ,(2007) , 10.1145/1242572.1242594
Aniko Hannak, Piotr Sapiezynski, Arash Molavi Kakhki, Balachander Krishnamurthy, David Lazer, Alan Mislove, Christo Wilson, Measuring personalization of web search Proceedings of the 22nd international conference on World Wide Web - WWW '13. pp. 527- 538 ,(2013) , 10.1145/2488388.2488435
Nikesh Garera, David Yarowsky, Modeling Latent Biographic Attributes in Conversational Genres international joint conference on natural language processing. pp. 710- 718 ,(2009) , 10.3115/1690219.1690245
Fadly Hamka, Harry Bouwman, Mark de Reuver, Maarten Kroesen, Mobile customer segmentation based on smartphone measurement Telematics and Informatics. ,vol. 31, pp. 220- 227 ,(2014) , 10.1016/J.TELE.2013.08.006
M Kosinski, T. Stillwell, D., Graepel, Private traits and attributes are predictable from digital records of human behavior Proceedings of the National Academy of Sciences of the United States of America. ,vol. 110, pp. 5802- 5805 ,(2013) , 10.1073/PNAS.1218772110