In Search of Credible News

作者: Momchil Hardalov , Ivan Koychev , Preslav Nakov

DOI: 10.1007/978-3-319-44748-3_17

关键词: Information retrievalScale (social sciences)World Wide WebPunctuationFeature setPronounFake newsSocial mediaCapitalizationCredibilityComputer science

摘要: We study the problem of finding fake online news. This is an important as news questionable credibility have recently been proliferating in social media at alarming scale. As this understudied problem, especially for languages other than English, we first collect and release to research community three new balanced credible vs. datasets derived from four sources. then propose a language-independent approach automatically distinguishing news, based on rich feature set. In particular, use linguistic (n-gram), credibility-related (capitalization, punctuation, pronoun use, sentiment polarity), semantic (embeddings DBPedia data) features. Our experiments different testsets show that our model can distinguish with very high accuracy.

参考文章(19)
Ann M. Brill, Online Journalists Embrace New Marketing Function Newspaper Research Journal. ,vol. 22, pp. 28- 40 ,(2001) , 10.1177/073953290102200203
Carlos Castillo, Marcelo Mendoza, Barbara Poblete, Predicting information credibility in time-sensitive social media Internet Research. ,vol. 23, pp. 560- 588 ,(2013) , 10.1108/INTR-05-2012-0095
T. Kohonen, Improved versions of learning vector quantization international joint conference on neural network. pp. 545- 550 ,(1990) , 10.1109/IJCNN.1990.137622
Thomas J. Johnson, Barbara K. Kaye, Shannon L. Bichard, W. Joann Wong, Every Blog Has Its Day: Politically-interested Internet Users’ Perceptions of Blog Credibility Journal of Computer-Mediated Communication. ,vol. 13, pp. 100- 122 ,(2007) , 10.1111/J.1083-6101.2007.00388.X
William P. Cassidy, Online News Credibility: An Examination of the Perceptions of Newspaper Journalists Journal of Computer-Mediated Communication. ,vol. 12, pp. 478- 498 ,(2007) , 10.1111/J.1083-6101.2007.00334.X
Dong C. Liu, Jorge Nocedal, On the limited memory BFGS method for large scale optimization Mathematical Programming. ,vol. 45, pp. 503- 528 ,(1989) , 10.1007/BF01589116
Stan Ketterer, Teaching Students How to Evaluate and Use Online Resources Journalism & Mass Communication Educator. ,vol. 52, pp. 4- 14 ,(1997) , 10.1177/107769589705200401
Rada Mihalcea, Carlo Strapparava, Making computers laugh Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 531- 538 ,(2005) , 10.3115/1220575.1220642
Arkaitz Zubiaga, Heng Ji, Tweet, but verify: epistemic study of information verification on Twitter Social Network Analysis and Mining. ,vol. 4, pp. 163- ,(2014) , 10.1007/S13278-014-0163-Y
Hui Zou, Trevor Hastie, Regularization and variable selection via the elastic net Journal of The Royal Statistical Society Series B-statistical Methodology. ,vol. 67, pp. 301- 320 ,(2005) , 10.1111/J.1467-9868.2005.00503.X