Adapting sentiment lexicons to domain-specific social media texts

作者: Shuyuan Deng , Atish P. Sinha , Huimin Zhao

DOI: 10.1016/J.DSS.2016.11.001

关键词:

摘要: Social media has become the largest data source of public opinion. The application sentiment analysis to social texts great potential, but faces challenges because domain heterogeneity. Sentiment orientation words varies by content domain, learning context-specific in domains continues be a major challenge. language poses another challenge since used today differs significantly from that traditional media. To address these challenges, we propose method adapt existing lexicons for domain-specific classification using an unannotated corpus and dictionary. We evaluate our two large developing corpora, containing 743,069 tweets related stock market one million political topics, respectively, five as seeds baselines. results demonstrate usefulness method, showing significant improvement performance. classification.The proposed addresses both domain.We corpora baselines.The evaluation method.

参考文章(73)
Jonas Krauss, Detlef Schoder, Stefan Nann, Predictive Analytics On Public Data - The Case Of Stock Markets european conference on information systems. pp. 102- ,(2013)
Andrea Esuli, Fabrizio Sebastiani, SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining language resources and evaluation. pp. 417- 422 ,(2006)
David P. Redlawsk, Ramnath Balasubramanyan, William W. Cohen, Doug Pierce, What pushes their buttons? Predicting comment polarity from the content of political blog posts Proceedings of the Workshop on Language in Social Media (LSM 2011). pp. 12- 19 ,(2011)
Tuo Li, Bing Jiang, Cheng Cheng, Wei Xu, Web Mining For Financial Market Prediction Based On Online Sentiments pacific asia conference on information systems. pp. 43- ,(2012)
Olutobi Owoputi, Kevin Gimpel, Nathan Schneider, Chris Dyer, Noah A. Smith, Brendan O'Connor, Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters north american chapter of the association for computational linguistics. pp. 380- 390 ,(2013)
Chau, Xu, Business intelligence in blogs: understanding consumer interactions and communities Management Information Systems Quarterly. ,vol. 36, pp. 1189- 1216 ,(2012) , 10.2307/41703504
LINA ZHOU, JUDEE K. BURGOON, DOUGLAS P. TWITCHELL, TIANTIAN QIN, JAY F. NUNAMAKER Jr., A Comparison of Classification Methods for Predicting Deception in Computer-Mediated Communication Journal of Management Information Systems. ,vol. 20, pp. 139- 166 ,(2004) , 10.1080/07421222.2004.11045779
Janyce Wiebe, Ellen Riloff, Creating Subjective and Objective Sentence Classifiers from Unannotated Texts Computational Linguistics and Intelligent Text Processing. ,vol. 3406, pp. 486- 497 ,(2005) , 10.1007/978-3-540-30586-6_53
Chen-Huei Chou, , Atish Sinha, Huimin Zhao, , , A Hybrid Attribute Selection Approach for Text Classification Journal of the Association for Information Systems. ,vol. 11, pp. 1- ,(2010) , 10.17705/1JAIS.00236
Robeson Bowmani, Pranav Anand, Jean E. Fox Tree, Marilyn Walker, Rob Abbott, Joseph King, How can you say such things?!?: Recognizing Disagreement in Informal Political Argument Proceedings of the Workshop on Language in Social Media (LSM 2011). pp. 2- 11 ,(2011)