Sentiment Extraction from Unstructured Text using Tabu Search-Enhanced Markov Blanket

作者: Edoardo Airoldi , Xue Bai , Rema Padman

DOI:

关键词: Set (abstract data type)Artificial intelligenceConditional dependenceTabu searchGuided Local SearchMarkov blanketMachine learningThe InternetMarketing researchVocabularyEngineering

摘要: Extracting sentiments from unstructured text has emerged as an important problem in many disciplines. An accurate method would enable us, for example, to mine on-line opinions the Internet and learn customers’ preferences economic or marketing research, leveraging a strategic advantage. In this paper, we propose two-stage Bayesian algorithm that is able capture dependencies among words, and, at same time, finds vocabulary efficient purpose of extracting sentiments. Experimental results on Movie Reviews data set show our select parsimonious feature with substantially fewer predictor variables than full leads better predictions about sentiment orientations several state-of-the-art machine learning methods. Our findings suggest are captured by conditional dependence relations rather keywords high-frequency words.

参考文章(22)
Peter Spirtes, Christopher Meek, Learning Bayesian networks with discrete variables from data knowledge discovery and data mining. pp. 294- 299 ,(1995)
Alison Huettner, Fuzzy Typing for Document Management ,(2000)
Xue Bai, Rema Padman, Tabu Search Enhanced Markov Blanket Classifier for High Dimensional Data Sets Operations Research/Computer Science Interfaces Series. pp. 337- 354 ,(2005) , 10.1007/0-387-23529-9_22
Clark N. Glymour, Peter Spirtes, Richard Scheines, Causation, prediction, and search ,(1993)
Foster Provost, R Fawcett, T, Kohavi, The Case against Accuracy Estimation for Comparing Induction Algorithms international conference on machine learning. pp. 445- 453 ,(1998)
Learning equivalence classes of bayesian-network structures Journal of Machine Learning Research. ,vol. 2, pp. 445- 498 ,(2002) , 10.1162/153244302760200696
Thorsten Joachims, A Statistical Learning Model of Text Classification for Support Vector Machines. international acm sigir conference on research and development in information retrieval. pp. 128- 136 ,(2001)
Yoav Freund, Robert E. Schapire, Large margin classification using the perceptron algorithm conference on learning theory. ,vol. 37, pp. 209- 217 ,(1998) , 10.1145/279943.279985
Charles Egerton Osgood, George J. Suci, Percy H. Tannenbaum, The Measurement of Meaning ,(1957)
Hugo Liu, Henry Lieberman, Ted Selker, A model of textual affect sensing using real-world knowledge intelligent user interfaces. pp. 125- 132 ,(2003) , 10.1145/604045.604067