It was easy, when apples and blackberries were only fruits

作者: Karl Aberer , Zoltán Miklós , Surender Reddy Yerva

DOI:

关键词:

摘要: Ambiguities in company names are omnipresent. This is not accidental, companies deliberately chose ambiguous brand names, as part of their marketing and branding strategy. procedure leads to new challenges, when it comes finding information about the on Web. paper concerned with task classifying Twitter messages, whether they related a given company: for example, we classify set twitter messages containing keyword apple, message Apple Inc. Our technique essentially an SVM classier, which uses simple representation relevant irrelevant form keywords, grouped specic profiles. We developed construct such classiers previously unseen companies, where no training available, by meta-features classier help general testset. techniques show high accuracy figures over WePS-3 dataset.

参考文章(18)
Patrick Paroubek, Alexander Pak, Twitter as a Corpus for Sentiment Analysis and Opinion Mining language resources and evaluation. ,(2010)
Michele Mancioppi, Heiko Stoermer, Paolo Bouquet, Daniel Giacomuzzi, OkkaM: Towards a Solution to the "Identity Crisis" on the Semantic Web. semantic web applications and perspectives. ,(2006)
Christopher M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics) Springer-Verlag New York, Inc.. ,(2006)
Donald Metzler, Susan Dumais, Christopher Meek, Similarity measures for short segments of text european conference on information retrieval. pp. 16- 27 ,(2007) , 10.1007/978-3-540-71496-5_5
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
David D. Lewis, Naive (Bayes) at forty: The independence assumption in information retrieval Machine Learning: ECML-98. pp. 4- 15 ,(1998) , 10.1007/BFB0026666
Wolfgang Kellerer, Wojciech Galuba, Karl Aberer, Zoran Despotovic, Dipanjan Chakraborty, Outtweeting the twitterers - predicting information cascades in microblogs workshop on online social networks. pp. 3- 3 ,(2010)
Ron Bekkerman, Andrew McCallum, Disambiguating Web appearances of people in a social network the web conference. pp. 463- 470 ,(2005) , 10.1145/1060745.1060813
Zhaoqi Chen, Dmitri V. Kalashnikov, Sharad Mehrotra, Exploiting context analysis for combining multiple entity resolution systems Proceedings of the 35th SIGMOD international conference on Management of data - SIGMOD '09. pp. 207- 218 ,(2009) , 10.1145/1559845.1559869