Sentiment and topic analysis on social media

作者: Shu Huang , Wei Peng , Jingxuan Li , Dongwon Lee

DOI: 10.1145/2464464.2464512

关键词: Social mediaPerceptionAmbiguityMulti-label classificationNatural language processingComputer scienceArtificial intelligenceClass (biology)Data miningIsolation (database systems)Task (project management)Sentiment analysis

摘要: Both sentiment analysis and topic classification are frequently used in customer care marketing. They can help people understand the brand perception opinions from social media, such as online posts, tweets, forums, blogs. As such, recent years, many solutions have been proposed for both tasks. However, we believe that following two problems not addressed adequately: (1) Conventional usually treat tasks isolation. When closely related (e.g., posts about "customer care" often a "negative" tone), exploring their correlation may yield better accuracy; (2) Each post is assigned with only one label label. Since media is, compared to traditional document corpus, more noisy, ambiguous, sparser, single be able capture classes accurately. To address these problems, this paper, propose multi-task multi-label (MTML) model performs of sentiments topics concurrently. It incorporates results each task prior steps promote reinforce other iteratively. For task, trained multiple labels so they class ambiguity. In empirical validation, compare accuracy MTML against four competing methods different settings. Results show produces much higher classifications.

参考文章(23)
Fadi A Thabtah, Peter Cowling, Yonghong Peng, None, MMAC: a new multi-class, multi-label associative classification approach international conference on data mining. pp. 217- 224 ,(2004) , 10.1109/ICDM.2004.10117
Boris Lauser, Andreas Hotho, Automatic Multi-label Subject Indexing in a Multilingual Environment international conference theory and practice digital libraries. pp. 140- 151 ,(2003) , 10.1007/978-3-540-45175-4_14
Rebecca Passonneau, Ilia Vovsha, Owen Rambow, Boyi Xie, Apoorv Agarwal, Sentiment Analysis of Twitter Data Proceedings of the Workshop on Language in Social Media (LSM 2011). pp. 30- 38 ,(2011)
Amanda Clare, Ross D. King, Knowledge Discovery in Multi-label Phenotype Data european conference on principles of data mining and knowledge discovery. pp. 42- 53 ,(2001) , 10.1007/3-540-44794-6_4
Tony Jebara, Multi-task feature and kernel selection for SVMs Twenty-first international conference on Machine learning - ICML '04. pp. 55- ,(2004) , 10.1145/1015330.1015426
Kyosuke Nishida, Ryohei Banno, Ko Fujimura, Takahide Hoshide, Tweet classification by data compression Proceedings of the 2011 international workshop on DETecting and Exploiting Cultural diversiTy on the social web - DETECT '11. pp. 29- 34 ,(2011) , 10.1145/2064448.2064473
Farbound Tai, Hsuan-Tien Lin, Multilabel classification with principal label space transformation Neural Computation. ,vol. 24, pp. 2508- 2542 ,(2012) , 10.1162/NECO_A_00320
Shai Ben-David, Reba Schuller, Exploiting Task Relatedness for Multiple Task Learning conference on learning theory. pp. 567- 580 ,(2003) , 10.1007/978-3-540-45167-9_41
Kyosuke Nishida, Takahide Hoshide, Ko Fujimura, Improving tweet stream classification by detecting changes in word probability Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '12. pp. 971- 980 ,(2012) , 10.1145/2348283.2348412
Freddy Chong Tat Chua, William W. Cohen, Justin Betteridge, Ee-Peng Lim, Community-based classification of noun phrases in twitter Proceedings of the 21st ACM international conference on Information and knowledge management - CIKM '12. pp. 1702- 1706 ,(2012) , 10.1145/2396761.2398501