In Search of Credible News

作者： Momchil Hardalov , Ivan Koychev , Preslav Nakov

关键词: Information retrieval 、 Scale (social sciences) 、 World Wide Web 、 Punctuation 、 Feature set 、 Pronoun 、 Fake news 、 Social media 、 Capitalization 、 Credibility 、 Computer science

摘要: We study the problem of finding fake online news. This is an important as news questionable credibility have recently been proliferating in social media at alarming scale. As this understudied problem, especially for languages other than English, we first collect and release to research community three new balanced credible vs. datasets derived from four sources. then propose a language-independent approach automatically distinguishing news, based on rich feature set. In particular, use linguistic (n-gram), credibility-related (capitalization, punctuation, pronoun use, sentiment polarity), semantic (embeddings DBPedia data) features. Our experiments different testsets show that our model can distinguish with very high accuracy.

参考文章(19)

Ann M. Brill, Online Journalists Embrace New Marketing Function Newspaper Research Journal. ,vol. 22, pp. 28- 40 ,(2001) , 10.1177/073953290102200203

Carlos Castillo, Marcelo Mendoza, Barbara Poblete, Predicting information credibility in time-sensitive social media Internet Research. ,vol. 23, pp. 560- 588 ,(2013) , 10.1108/INTR-05-2012-0095

T. Kohonen, Improved versions of learning vector quantization international joint conference on neural network. pp. 545- 550 ,(1990) , 10.1109/IJCNN.1990.137622

Thomas J. Johnson, Barbara K. Kaye, Shannon L. Bichard, W. Joann Wong, Every Blog Has Its Day: Politically-interested Internet Users’ Perceptions of Blog Credibility Journal of Computer-Mediated Communication. ,vol. 13, pp. 100- 122 ,(2007) , 10.1111/J.1083-6101.2007.00388.X

William P. Cassidy, Online News Credibility: An Examination of the Perceptions of Newspaper Journalists Journal of Computer-Mediated Communication. ,vol. 12, pp. 478- 498 ,(2007) , 10.1111/J.1083-6101.2007.00334.X

Dong C. Liu, Jorge Nocedal, On the limited memory BFGS method for large scale optimization Mathematical Programming. ,vol. 45, pp. 503- 528 ,(1989) , 10.1007/BF01589116

Stan Ketterer, Teaching Students How to Evaluate and Use Online Resources Journalism & Mass Communication Educator. ,vol. 52, pp. 4- 14 ,(1997) , 10.1177/107769589705200401

Rada Mihalcea, Carlo Strapparava, Making computers laugh Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 531- 538 ,(2005) , 10.3115/1220575.1220642

Arkaitz Zubiaga, Heng Ji, Tweet, but verify: epistemic study of information verification on Twitter Social Network Analysis and Mining. ,vol. 4, pp. 163- ,(2014) , 10.1007/S13278-014-0163-Y

10.

Hui Zou, Trevor Hastie, Regularization and variable selection via the elastic net Journal of The Royal Statistical Society Series B-statistical Methodology. ,vol. 67, pp. 301- 320 ,(2005) , 10.1111/J.1467-9868.2005.00503.X

In Search of Credible News

来源期刊

我的账户

In Search of Credible News

来源期刊

相似文章 10

我的账户