Phonetic-Based Microtext Normalization for Twitter Sentiment Analysis

作者: Ranjan Satapathy , Claudia Guerreiro , Iti Chaturvedi , Erik Cambria

DOI: 10.1109/ICDMW.2017.59

关键词:

摘要: The proliferation of Web 2.0 technologies and the increasing use computer-mediated communication resulted in a new form written text, termed microtext. This poses challenges to natural language processing tools which are usually designed for well-written text. paper proposes phonetic-based framework normalizing microtext plain English and, hence, improve classification accuracy sentiment analysis. Results demonstrated that there is high (>0.8) similarity index between tweets normalized by our model human annotators 85.31% cases, an increase >4% terms polarity detection after normalization.

参考文章(58)
Matheus Araújo, Pollyanna Gonçalves, Meeyoung Cha, Fabrício Benevenuto, iFeel Proceedings of the 23rd International Conference on World Wide Web - WWW '14 Companion. pp. 75- 78 ,(2014) , 10.1145/2567948.2577013
Moshe Koppel, Itai Shtrimberg, Good News or Bad News? Let the Market Decide Computing Attitude and Affect in Text. pp. 297- 301 ,(2006) , 10.1007/1-4020-4102-0_22
Dilek Tapucu, Berrin Yanikoglu, Rahim Dehkharghani, Yucel Saygin, Gizem Gezici, SU-Sentilab : A Classification System for Sentiment Analysis in Twitter joint conference on lexical and computational semantics. ,vol. 2, pp. 471- 477 ,(2013)
Erik Cambria, Andrew Livingstone, Amir Hussain, The Hourglass of Emotions Cognitive Behavioural Systems. pp. 144- 157 ,(2012) , 10.1007/978-3-642-34584-5_11
Miles Osborne, Saša Petrović, Victor Lavrenko, The Edinburgh Twitter Corpus north american chapter of the association for computational linguistics. pp. 25- 26 ,(2010)
Imdad Ali Ismaili, Zeeshan Bhatti, Asad Ali Shaikh, Waseem Javaid, Spelling Error Trends and Patterns in Sindhi arXiv: Computation and Language. ,(2014)
Erik Cambria, Haixun Wang, Bebo White, Guest Editorial: Big Social Data Analysis Knowledge-Based Systems. ,vol. 69, pp. 1- 2 ,(2014) , 10.1016/J.KNOSYS.2014.07.002
Felipe Bravo-Marquez, Marcelo Mendoza, Barbara Poblete, Meta-level sentiment models for big social data analysis Knowledge Based Systems. ,vol. 69, pp. 86- 99 ,(2014) , 10.1016/J.KNOSYS.2014.05.016
Kenneth W. Church, William A. Gale, Probability scoring for spelling correction Statistics and Computing. ,vol. 1, pp. 93- 103 ,(1991) , 10.1007/BF01889984