Feature subsumption for sentiment classification in multiple languages

作者: Zhongwu Zhai , Hua Xu , Jun Li , Peifa Jia

DOI: 10.1007/978-3-642-13672-6_26

关键词:

摘要: An open problem in machine learning-based sentiment classification is how to extract complex features that outperform simple features; figuring out which types of are most valuable another Most the studies focus primarily on character or word Ngrams features, but substring-group have never been considered area before In this study, extracted and selected for by means transductive algorithm To demonstrate generality, experiments conducted three datasets different languages: Chinese, English Spanish The experimental results show proposed algorithm's performance usually superior best related work, feature subsumption multilingual Compared inductive algorithm, also illustrate can significantly improve As term weighting, “tfidf-c” outperforms all other weighting approaches algorithm.

参考文章(36)
Nigel Collier, Tony Mullen, Sentiment Analysis using Support Vector Machines with Diverse Information Sources empirical methods in natural language processing. pp. 412- 418 ,(2004)
Wessel Kraaij, Stephan Raaijmakers, A Shallow Approach to Subjectivity Classification international conference on weblogs and social media. pp. 216- ,(2008)
Minqing Hu, Bing Liu, Mining opinion features in customer reviews national conference on artificial intelligence. pp. 755- 760 ,(2004)
Xiaowen Ding, Bing Liu, Philip S. Yu, A holistic lexicon-based approach to opinion mining web search and data mining. pp. 231- 240 ,(2008) , 10.1145/1341531.1341561
Jun Li, Maosong Sun, Experimental Study on Sentiment Classification of Chinese Review using Machine Learning Techniques international conference natural language processing. pp. 393- 400 ,(2007) , 10.1109/NLPKE.2007.4368061
Songbo Tan, Yuefen Wang, Xueqi Cheng, Combining learn-based and lexicon-based techniques for sentiment detection without using labeled examples Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 743- 744 ,(2008) , 10.1145/1390334.1390481
E. Ukkonen, On-line construction of suffix trees Algorithmica. ,vol. 14, pp. 249- 260 ,(1995) , 10.1007/BF01206331