Measuring praise and criticism: Inference of semantic orientation from association

作者: Peter D. Turney , Michael L. Littman

DOI: 10.1145/944012.944013

关键词:

摘要: The evaluative character of a word is called its semantic orientation. Positive orientation indicates praise (e.g., "honest", "intrepid") and negative criticism "disturbing", "superfluous"). Semantic varies in both direction (positive or negative) degree (mild to strong). An automated system for measuring would have application text classification, filtering, tracking opinions online discussions, analysis survey responses, chat systems (chatbots). This article introduces method inferring the from statistical association with set positive paradigm words. Two instances this approach are evaluated, based on two different measures association: pointwise mutual information (PMI) latent (LSA). experimentally tested 3,596 words (including adjectives, adverbs, nouns, verbs) that been manually labeled (1,614 words) (1,982 words). attains an accuracy 82.8p full test set, but rises above 95p when algorithm allowed abstain classifying mild

参考文章(32)
Thomas K Landauer, On the computational basis of learning and cognition: Arguments from LSA Psychology of Learning and Motivation. ,vol. 41, pp. 43- 84 ,(2002) , 10.1016/S0079-7421(02)80004-4
Janyce Wiebe, Learning Subjective Adjectives from Corpora national conference on artificial intelligence. pp. 735- 740 ,(2000)
Peter D. Turney, Mining the web for synonyms: PMI-IR versus LSA on TOEFL european conference on machine learning. pp. 491- 502 ,(2001) , 10.1007/3-540-44795-4_42
Alexander Budanitsky, Graeme Hirst, Jiang— Conrath, Keith Alcock, Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures ,(2004)
Christopher D. Manning, Hinrich Schütze, Foundations of Statistical Natural Language Processing ,(1999)
Patrick Hanks, Kenneth Ward Church, Word association norms, mutual information, and lexicography Computational Linguistics. ,vol. 16, pp. 22- 29 ,(1990) , 10.5555/89086.89095
J. Kamps, M.J. Marx, Words with attitude TESOL QUARTERLY. ,vol. 16, pp. 82- 88 ,(2002)
Frank Smadja, Retrieving collocations from text: Xtract Computational Linguistics. ,vol. 19, pp. 143- 177 ,(1993)
Brian T. Bartell, Garrison W. Cottrell, Richard K. Belew, Latent semantic indexing is an optimal special case of multidimensional scaling international acm sigir conference on research and development in information retrieval. pp. 161- 167 ,(1992) , 10.1145/133160.133191