Using Crowdsourcing and Active Learning to Track Sentiment in Online Media

作者: Pádraig Cunningham , Derek Greene , Anthony Brew

DOI: 10.3233/978-1-60750-606-5-145

关键词:

摘要: Tracking sentiment in the popular media has long been of interest to analysts and pundits. With availability news content via online syndicated feeds, it is now possible automate some aspects this process. There also great potential crowdsource Crowdsourcing a term, sometimes associated with Web 2.0 technologies, that describes outsourcing tasks large often anonymous community. much annotation work required train machine learning system perform scoring. We describe such for tracking economic deployed since August 2009. It uses annotations provided by cohort non-expert annotators classify body items. report on design challenges addressed managing effort making an interesting experience.

参考文章(16)
A. P. Dawid, A. M. Skene, Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm Journal of The Royal Statistical Society Series C-applied Statistics. ,vol. 28, pp. 20- 28 ,(1979) , 10.2307/2346806
Giuseppe Attardi, Maria Simi, Blog Mining Through Opinionated Words text retrieval conference. ,(2006)
Foster Provost, R Fawcett, T, Kohavi, The Case against Accuracy Estimation for Comparing Induction Algorithms international conference on machine learning. pp. 445- 453 ,(1998)
Shotaro Matsumoto, Hiroya Takamura, Manabu Okumura, Sentiment Classification Using Word Sub-sequences and Dependency Sub-trees Advances in Knowledge Discovery and Data Mining. ,vol. 3518, pp. 301- 311 ,(2005) , 10.1007/11430919_37
Rion Snow, Brendan O'Connor, Daniel Jurafsky, Andrew Y. Ng, Cheap and fast---but is it good? Proceedings of the Conference on Empirical Methods in Natural Language Processing - EMNLP '08. pp. 254- 263 ,(2008) , 10.3115/1613715.1613751
Vikas C. Raykar, Shipeng Yu, Linda H. Zhao, Anna Jerebko, Charles Florin, Gerardo Hermosillo Valadez, Luca Bogoni, Linda Moy, Supervised learning from multiple experts Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 889- 896 ,(2009) , 10.1145/1553374.1553488
Theresa Wilson, Janyce Wiebe, Paul Hoffmann, Recognizing contextual polarity in phrase-level sentiment analysis Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 347- 354 ,(2005) , 10.3115/1220575.1220619
Victor S. Sheng, Foster Provost, Panagiotis G. Ipeirotis, Get another label? improving data quality and data mining using multiple, noisy labelers Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 614- 622 ,(2008) , 10.1145/1401890.1401965
Anthony Brew, Derek Greene, Pádraig Cunningham, Is it Over Yet? Learning to Recognize Good News in Financial Media ,(2010)
Pinar Donmez, Jaime G. Carbonell, Jeff Schneider, Efficiently learning the accuracy of labeling sources for selective sampling Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '09. pp. 259- 268 ,(2009) , 10.1145/1557019.1557053