Effects of Training Datasets on Both the Extreme Learning Machine and Support Vector Machine for Target Audience Identification on Twitter

作者: Siaw Ling Lo , David Cornforth , Raymond Chiong

DOI: 10.1007/978-3-319-14063-6_35

关键词:

摘要: The ability to identify or predict a target audience from the increasingly crowded social space will provide company some competitive advantage over other companies. In this paper, we analyze various training datasets, which include Twitter contents of an account owner and its list followers, using features generated in different ways for two machine learning approaches - Extreme Learning Machine (ELM) Support Vector (SVM). Various configurations ELM SVM have been evaluated. results indicate that datasets tweets achieve best performance, relative feature sets. This finding is important may aid researchers developing classifier capable identifying specific group members. assist spend resources more effectively, by sending offers right audience, hence maximize marketing efficiency improve return on investment.

参考文章(34)
Alexandre Passant, Denny Vrandecic, John G. Breslin, Social Semantic Web. Handbook of Semantic Web Technologies. pp. 467- 506 ,(2011)
Evgeniy Gabrilovich, Shaul Markovitch, Feature generation for text categorization using world knowledge international joint conference on artificial intelligence. pp. 1048- 1053 ,(2005)
Yongzheng Zhang, Marco Pennacchiotti, Predicting purchase behaviors from social media Proceedings of the 22nd international conference on World Wide Web - WWW '13. pp. 1521- 1532 ,(2013) , 10.1145/2488388.2488521
Marina Sokolova, Nathalie Japkowicz, Stan Szpakowicz, Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation Lecture Notes in Computer Science. pp. 1015- 1021 ,(2006) , 10.1007/11941439_114
Christopher D. Manning, Hinrich Schütze, Foundations of Statistical Natural Language Processing ,(1999)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Tao Yang, Dongwon Lee, Su Yan, Steeler nation, 12th man, and boo birds: classifying Twitter user interests using time series advances in social networks analysis and mining. pp. 684- 691 ,(2013) , 10.1145/2492517.2492551
Erik Cambria, Thomas Mazzocco, Amir Hussain, Application of multi-dimensional scaling and artificial neural networks for biologically inspired opinion mining biologically inspired cognitive architectures. ,vol. 4, pp. 41- 53 ,(2013) , 10.1016/J.BICA.2013.02.003
Grzegorz Kondrak, Daniel Marcu, Kevin Knight, Cognates can improve statistical translation models Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology companion volume of the Proceedings of HLT-NAACL 2003--short papers - NAACL '03. pp. 46- 48 ,(2003) , 10.3115/1073483.1073499
Peter Willett, The Porter stemming algorithm: then and now Program: Electronic Library and Information Systems. ,vol. 40, pp. 219- 223 ,(2006) , 10.1108/00330330610681295