Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources

作者: Fenglong Ma , Chuishi Meng , Houping Xiao , Qi Li , Jing Gao

DOI: 10.1145/3097983.3098129

关键词: Data sciencePharmaceutical industryQuality (business)Health informaticsField (computer science)Drug developmentDrug side effectsAdverse Event Reporting SystemComputer scienceDrug

摘要: Drug side-effects become a worldwide public health concern, which are the fourth leading cause of death in United States. Pharmaceutical industry has paid tremendous effort to identify drug during development. However, it is impossible and impractical all them. Fortunately, can also be reported on heterogeneous platforms (i.e., data sources), such as FDA Adverse Event Reporting System various online communities. existing supervised semi-supervised approaches not practical annotating labels expensive medical field. In this paper, we propose novel effective unsupervised model Sifter automatically discover side-effects. enhances estimation by learning from measuring platform-level user-level quality simultaneously. way, demonstrates better performance compared with terms correctly identifying Experimental results five real-world datasets show that significantly improve state-of-the-art approaches.

参考文章(36)
Ryen W White, Nicholas P Tatonetti, Nigam H Shah, Russ B Altman, Eric Horvitz, Web-scale pharmacovigilance: listening to signals from the crowd Journal of the American Medical Informatics Association. ,vol. 20, pp. 404- 408 ,(2013) , 10.1136/AMIAJNL-2012-001482
Yanqing Ji, Hao Ying, Peter Dews, Ayman Mansour, John Tran, Richard E Miller, R Michael Massanari, A Potential Causal Association Mining Algorithm for Screening Adverse Drug Reactions in Postmarketing Surveillance international conference of the ieee engineering in medicine and biology society. ,vol. 15, pp. 428- 437 ,(2011) , 10.1109/TITB.2011.2131669
Dong Wang, Lance Kaplan, Hieu Le, Tarek Abdelzaher, On truth discovery in social sensing: a maximum likelihood estimation approach information processing in sensor networks. pp. 233- 244 ,(2012) , 10.1145/2185677.2185737
Alban Galland, Serge Abiteboul, Amélie Marian, Pierre Senellart, Corroborating information from disagreeing views web search and data mining. pp. 131- 140 ,(2010) , 10.1145/1718487.1718504
Aron Henriksson, Jing Zhao, Henrik Bostrom, Hercules Dalianis, Modeling electronic health records in ensembles of semantic spaces for adverse drug event detection bioinformatics and biomedicine. pp. 343- 350 ,(2015) , 10.1109/BIBM.2015.7359705
Satya Katragadda, Harika Karnati, Murali Pusala, Vijay Raghavan, Ryan Benton, Detecting adverse drug effects using link classification on twitter data bioinformatics and biomedicine. pp. 675- 679 ,(2015) , 10.1109/BIBM.2015.7359767
Xin Luna Dong, Barna Saha, Divesh Srivastava, Less is more Proceedings of the VLDB Endowment. ,vol. 6, pp. 37- 48 ,(2012) , 10.14778/2535568.2448938
Qi Li, Yaliang Li, Jing Gao, Lu Su, Bo Zhao, Murat Demirbas, Wei Fan, Jiawei Han, A confidence-aware approach for truth discovery on long-tail data Proceedings of the VLDB Endowment. ,vol. 8, pp. 425- 436 ,(2014) , 10.14778/2735496.2735505
Guoliang Li, Jiannan Wang, Yudian Zheng, Michael J. Franklin, Crowdsourced Data Management: A Survey IEEE Transactions on Knowledge and Data Engineering. ,vol. 28, pp. 2296- 2319 ,(2016) , 10.1109/TKDE.2016.2535242