Detecting opinion spams through supervised boosting approach.

作者: Mohamad Hazim , Nor Badrul Anuar , Mohd Faizal Ab Razak , Nor Aniza Abdullah , None

DOI: 10.1371/JOURNAL.PONE.0198884

关键词:

摘要: Product reviews are the individual’s opinions, judgement or belief about a certain product service provided by companies. Such serve as guides for these companies to plan and monitor their business ventures in terms of increasing productivity enhancing product/service qualities. can also increase profits convincing future customers products which they have interest in. In mobile application marketplace such Google Playstore, star ratings used indicators quality. However, among all reviews, hereby known spams exist, disrupt online balance. Previous studies time series neural network approach (which require lot computational power) detect opinion spams. detection performance be restricted accuracy because focusses on basic, discrete document level features only thereby, projecting little statistical relationships. Aiming improve marketplace, this study proposes using based that modelled through supervised boosting Extreme Gradient Boost (XGBoost) Generalized Boosted Regression Model (GBM) evaluate two multilingual datasets (i.e. English Malay language). From evaluation done, it was found XGBoost is most suitable detecting dataset while GBM Gaussian dataset. The comparative analysis indicates implementation proposed had achieved rate 87.43 per cent 86.13

参考文章(52)
Christopher Glenn Harris, Detecting Deceptive Opinion Spam Using Human Computation national conference on artificial intelligence. ,(2012)
Vivek Venkataraman, Arjun Mukherjee, Bing Liu, Natalie S. Glance, What Yelp Fake Review Filter Might Be Doing international conference on weblogs and social media. pp. 409- 418 ,(2013)
David Savage, Xiuzhen Zhang, Xinghuo Yu, Pauline Chou, Qingmai Wang, Detection of opinion spam based on anomalous rating deviation Expert Systems With Applications. ,vol. 42, pp. 8650- 8657 ,(2015) , 10.1016/J.ESWA.2015.07.019
Jerome H. Friedman, Greedy function approximation: A gradient boosting machine. Annals of Statistics. ,vol. 29, pp. 1189- 1232 ,(2001) , 10.1214/AOS/1013203451
Junting Ye, Leman Akoglu, Discovering Opinion Spammer Groups by Network Footprints conference on online social networks. pp. 97- 97 ,(2015) , 10.1145/2817946.2820606
Aoying Zhou, Xiaoling Wang, Jingwei Zhang, Hao Wu, Yuming Lin, Tao Zhu, Towards online anti-opinion spam: spotting fake reviews from the review sequence advances in social networks analysis and mining. pp. 261- 264 ,(2014) , 10.5555/3191835.3191887
Qingxi Peng, Ming Zhong, Detecting Spam Review through Sentiment Analysis Journal of Software. ,vol. 9, pp. 2065- 2072 ,(2014) , 10.4304/JSW.9.8.2065-2072
Stephen Tyree, Kilian Q. Weinberger, Kunal Agrawal, Jennifer Paykin, Parallel boosted regression trees for web search ranking Proceedings of the 20th international conference on World wide web - WWW '11. pp. 387- 396 ,(2011) , 10.1145/1963405.1963461
Yoav Freund, Robert E Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting conference on learning theory. ,vol. 55, pp. 119- 139 ,(1997) , 10.1006/JCSS.1997.1504
Jerry Ye, Jyh-Herng Chow, Jiang Chen, Zhaohui Zheng, Stochastic gradient boosted distributed decision trees Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09. pp. 2061- 2064 ,(2009) , 10.1145/1645953.1646301