Learning sentiment classification model from labeled features

作者: Yulan He

DOI: 10.1145/1871437.1871704

关键词:

摘要: We propose a novel framework where an initial classifier is learned by incorporating prior information extracted from existing sentiment lexicon. Preferences on expectations of labels those lexicon words are expressed using generalized expectation criteria. Documents classified with high confidence then used as pseudo-labeled examples for automatical domain-specific feature acquisition. The word-class distributions such self-learned features estimated the and to train another constraining model's predictions unlabeled instances. Experiments both movie review data multi-domain dataset show that our approach attains comparable or better performance than exiting weakly-supervised classification methods despite no labeled documents.

参考文章(11)
Richard D. Lawrence, Prem Melville, Wojciech Gryc, Sentiment analysis of blogs by combining lexical knowledge with text classification Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '09. pp. 1275- 1284 ,(2009) , 10.1145/1557019.1557156
Songbo Tan, Yuefen Wang, Xueqi Cheng, Combining learn-based and lexicon-based techniques for sentiment detection without using labeled examples Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 743- 744 ,(2008) , 10.1145/1390334.1390481
Likun Qiu, Weishi Zhang, Changjian Hu, Kai Zhao, SELC Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09. pp. 929- 936 ,(2009) , 10.1145/1645953.1646072
Taras Zagibalov, John Carroll, Automatic Seed Word Selection for Unsupervised Sentiment Classification of Chinese Text international conference on computational linguistics. pp. 1073- 1080 ,(2008) , 10.3115/1599081.1599216
Ganesh Ramakrishnan, Apurva Jadhav, Ashutosh Joshi, Soumen Chakrabarti, Pushpak Bhattacharyya, Question Answering via Bayesian Inference on Lexical Relations meeting of the association for computational linguistics. pp. 1- 10 ,(2003) , 10.3115/1119312.1119313
Sajib Dasgupta, Vincent Ng, Topic-wise, Sentiment-wise, or Otherwise? Identifying the Hidden Dimension for Unsupervised Text Classification empirical methods in natural language processing. pp. 580- 589 ,(2009) , 10.3115/1699571.1699589
Yulan He, Chenghua Lin, Richard Everson, A Comparative Study of Bayesian Models for Unsupervised Sentiment Detection conference on computational natural language learning. pp. 144- 152 ,(2010)
Gregory Druck, Gideon Mann, Andrew McCallum, Learning from labeled features using generalized expectation criteria Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 595- 602 ,(2008) , 10.1145/1390334.1390436
Tao Li, Yi Zhang, Vikas Sindhwani, A Non-negative Matrix Tri-factorization Approach to Sentiment Classification with Lexical Prior Knowledge international joint conference on natural language processing. pp. 244- 252 ,(2009) , 10.3115/1687878.1687914
Alina Andreevskaia, Sabine Bergler, When Specialists and Generalists Work Together: Overcoming Domain Dependence in Sentiment Tagging meeting of the association for computational linguistics. pp. 290- 298 ,(2008)