Truth Discovery by Claim and Source Embedding

作者: Shanshan Lyu , Wentao Ouyang , Yongqing Wang , Huawei Shen , Xueqi Cheng

DOI: 10.1109/TKDE.2019.2936189

关键词:

摘要: Information gathered from multiple sources on the Web often exhibits conflicts. This phenomenon motivates need of truth discovery , which aims to automatically find true claim among conflicting claims. Existing methods are mainly based iterative updates, optimization or probabilistic models. Although these have shown their own effectiveness, they a common limitation. These do not model relationships between each pair source and target such that well capture underlying interactions in data. In this paper, we propose new for discovery, learning representations claims targets. Our first constructs heterogenous network including source-claim, source-source truth-claim relationships. It then embeds into low dimensional space trustworthy close. way, can be conveniently performed embedding space. Moreover, our implemented both semi-supervised un-supervised manners deal with label scarcity problem practical discovery. Experiments three real-world datasets demonstrate outperforms existing state-of-the-art

参考文章(48)
Xian Li, Xin Luna Dong, Kenneth Lyons, Weiyi Meng, Divesh Srivastava, Truth finding on the deep web Proceedings of the VLDB Endowment. ,vol. 6, pp. 97- 108 ,(2012) , 10.14778/2535568.2448943
Xin Luna Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Kevin Murphy, Shaohua Sun, Wei Zhang, From data fusion to knowledge fusion Proceedings of the VLDB Endowment. ,vol. 7, pp. 881- 892 ,(2014) , 10.14778/2732951.2732962
Dan Roth, Jeff Pasternack, Knowing What to Believe (when you already know something) international conference on computational linguistics. pp. 877- 885 ,(2010)
Jeff Pasternack, Dan Roth, Latent credibility analysis Proceedings of the 22nd international conference on World Wide Web - WWW '13. pp. 1009- 1020 ,(2013) , 10.1145/2488388.2488476
Qiaozhu Mei, Meng Qu, Mingzhe Wang, Jian Tang, Ming Zhang, Jun Yan, LINE: Large-scale Information Network Embedding the web conference. pp. 1067- 1077 ,(2015) , 10.1145/2736277.2741093
Rion Snow, Brendan O'Connor, Daniel Jurafsky, Andrew Y. Ng, Cheap and fast---but is it good? Proceedings of the Conference on Empirical Methods in Natural Language Processing - EMNLP '08. pp. 254- 263 ,(2008) , 10.3115/1613715.1613751
Subhabrata Mukherjee, Gerhard Weikum, Cristian Danescu-Niculescu-Mizil, People on drugs: credibility of user statements in health communities knowledge discovery and data mining. pp. 65- 74 ,(2014) , 10.1145/2623330.2623714
Vikas C. Raykar, Shipeng Yu, Linda H. Zhao, Anna Jerebko, Charles Florin, Gerardo Hermosillo Valadez, Luca Bogoni, Linda Moy, Supervised learning from multiple experts Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 889- 896 ,(2009) , 10.1145/1553374.1553488
Yaliang Li, Qi Li, Jing Gao, Lu Su, Bo Zhao, Wei Fan, Jiawei Han, On the Discovery of Evolving Truth knowledge discovery and data mining. ,vol. 2015, pp. 675- 684 ,(2015) , 10.1145/2783258.2783277
Yaliang Li, Qi Li, Minghui Qiu, Jing Gao, Shi Zhi, Lu Su, Bo Zhao, Heng Ji, Jiawei Han, Fenglong Ma, FaitCrowd: Fine Grained Truth Discovery for Crowdsourced Data Aggregation knowledge discovery and data mining. pp. 745- 754 ,(2015) , 10.1145/2783258.2783314