Feature subsumption for sentiment classification in multiple languages

作者： Zhongwu Zhai , Hua Xu , Jun Li , Peifa Jia

关键词:

摘要: An open problem in machine learning-based sentiment classification is how to extract complex features that outperform simple features; figuring out which types of are most valuable another Most the studies focus primarily on character or word Ngrams features, but substring-group have never been considered area before In this study, extracted and selected for by means transductive algorithm To demonstrate generality, experiments conducted three datasets different languages: Chinese, English Spanish The experimental results show proposed algorithm's performance usually superior best related work, feature subsumption multilingual Compared inductive algorithm, also illustrate can significantly improve As term weighting, “tfidf-c” outperforms all other weighting approaches algorithm.

uni-trier.de 本地加速

springer.com 本地加速

sci-hub.st HTML 下载加速

参考文章(36)

Nigel Collier, Tony Mullen, Sentiment Analysis using Support Vector Machines with Diverse Information Sources empirical methods in natural language processing. pp. 412- 418 ,(2004)

Dan Gusfield, Algorithms on Strings, Trees, and Sequences: Suffix Trees and Their Uses ,(1997) , 10.1017/CBO9780511574931

Bing Liu, Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data ,(2010)

Wessel Kraaij, Stephan Raaijmakers, A Shallow Approach to Subjectivity Classification international conference on weblogs and social media. pp. 216- ,(2008)

Minqing Hu, Bing Liu, Mining opinion features in customer reviews national conference on artificial intelligence. pp. 755- 760 ,(2004)

Xiaowen Ding, Bing Liu, Philip S. Yu, A holistic lexicon-based approach to opinion mining web search and data mining. pp. 231- 240 ,(2008) , 10.1145/1341531.1341561

Jun Li, Maosong Sun, Experimental Study on Sentiment Classification of Chinese Review using Machine Learning Techniques international conference natural language processing. pp. 393- 400 ,(2007) , 10.1109/NLPKE.2007.4368061

Dan Gusfield, Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology ,(1997)

Songbo Tan, Yuefen Wang, Xueqi Cheng, Combining learn-based and lexicon-based techniques for sentiment detection without using labeled examples Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 743- 744 ,(2008) , 10.1145/1390334.1390481

10.

E. Ukkonen, On-line construction of suffix trees Algorithmica. ,vol. 14, pp. 249- 260 ,(1995) , 10.1007/BF01206331

Feature subsumption for sentiment classification in multiple languages

来源期刊

我的账户

Feature subsumption for sentiment classification in multiple languages

来源期刊

相似文章 6

Subjectivity and Sentiment Analysis of Arabic: A Survey

Baseline evaluation: an empirical study of the performance of machine learning algorithms in short snippet sentiment analysis

An empirical study of unsupervised sentiment classification of Chinese reviews

Evaluating Feature Sets and Classifiers for Sentiment Analysis of Financial News

HASKER: An efficient algorithm for string kernels. Application to polarity classification in various languages

Sentiment/subjectivity analysis survey for languages other than English

我的账户