Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new dataset, the STS-Gold.

作者: Harith Alani , Yulan He , Miriam Fernández , Hassan Saif

DOI:

关键词:

摘要: Sentiment analysis over Twitter offers organisations and individuals a fast effective way to monitor the publics' feelings towards them their competitors. To assess performance of sentiment methods small set evaluation datasets have been released in last few years. In this paper we present an overview eight publicly available manually annotated for analysis. Based on review, show that common limitation most these datasets, when assessing at target (entity) level, is lack distinctive annotations among tweets entities contained them. For example, tweet "I love iPhone, but I hate iPad" can be with mixed label, entity iPhone within should positive label. Aiming overcome limitation, complement current STS-Gold, new dataset where targets (entities) are individually therefore may different labels. This also provides comparative study various along several dimensions including: total number tweets, vocabulary size sparsity. We investigate pair-wise correlation as well correlations classification datasets.

参考文章(24)
Xia Hu, Jiliang Tang, Huiji Gao, Huan Liu, Unsupervised sentiment analysis with emotional signals Proceedings of the 22nd international conference on World Wide Web - WWW '13. pp. 607- 618 ,(2013) , 10.1145/2488388.2488442
Mohamed S. Kamel, Masoud Makrehchi, Automatic extraction of domain-specific stopwords from labeled documents european conference on information retrieval. pp. 222- 233 ,(2008) , 10.5555/1793274.1793304
Tawunrat Chalothorn, Jeremy Ellman, TJP: Using Twitter to Analyze the Polarity of Contexts joint conference on lexical and computational semantics. pp. 375- 379 ,(2013)
Hassan Saif, Yulan He, Harith Alani, Semantic sentiment analysis of twitter international semantic web conference. pp. 508- 524 ,(2012) , 10.1007/978-3-642-35176-1_32
Harith Alani, Yulan He, Hassan Saif, Semantic smoothing for Twitter sentiment analysis ,(2011)
William Deitrick, Wei Hu, Mutually Enhancing Community Detection and Sentiment Analysis on Twitter Networks Journal of Data Analysis and Information Processing. ,vol. 01, pp. 19- 29 ,(2013) , 10.4236/JDAIP.2013.13004
Mike Thelwall, Kevan Buckley, Georgios Paltoglou, Sentiment strength detection for the social web Journal of the Association for Information Science and Technology. ,vol. 63, pp. 163- 173 ,(2012) , 10.1002/ASI.21662
David A. Shamma, Nicholas A. Diakopoulos, Characterizing debate performance via aggregated twitter sentiment human factors in computing systems. pp. 1195- 1198 ,(2010) , 10.1145/1753326.1753504
Mike Thelwall, Arvid Kappas, Georgios Paltoglou, Kevan Buckley, Di Cai, Sentiment in short strength detection informal text Journal of the Association for Information Science and Technology. ,vol. 61, pp. 2544- 2558 ,(2010) , 10.1002/ASI.V61:12
David A. Shamma, Lyndon Kennedy, Elizabeth F. Churchill, Tweet the debates: understanding community annotation of uncollected sources Proceedings of the first SIGMM workshop on Social media. pp. 3- 10 ,(2009) , 10.1145/1631144.1631148