Trialstreamer: a living, automatically updated database of clinical trial reports

作者: Iain J Marshall , Benjamin Nye , Joël Kuiper , Anna Noel-Storr , Rachel Marshall

DOI: 10.1101/2020.05.15.20103044

关键词:

摘要: Objective Randomized controlled trials (RCTs) are the gold standard method for evaluating whether a treatment works in healthcare, but can be difficult to find and make use of. We describe development evaluation of system automatically categorize all new RCT reports. Materials Methods Trialstreamer, continuously monitors PubMed WHO International Clinical Trials Registry Platform (ICTRP), looking RCTs humans using validated classifier. combine machine learning rule-based methods extract information from abstracts, including free-text descriptions trial populations, interventions outcomes (the 9PICO9) map these snippets normalised MeSH vocabulary terms. additionally identify sample sizes, predict risk bias, text conveying key findings. store extracted data database which we freely available download, via search portal, allows users enter structured clinical queries. Results ranked prioritize larger higher-quality studies. Results As May 2020, have indexed 669,895 publications RCTs, 18,485 were published first four months 2020 (144/day). include 303,319 registrations ICTRP. The median size was 66. Conclusions We present an automated finding categorising RCTs. This yields novel resource: A humans. daily updates this on our website (trialstreamer.robotreviewer.net).

参考文章(19)
David C. Slawson, Allen F. Shaughnessy, Joshua H. Bennett, Becoming an information master: a guidebook to the medical information jungle. Journal of Family Practice. ,vol. 39, pp. 489- 500 ,(1994)
Zhiheng Huang, Kai Yu, Wei Xu, Bidirectional LSTM-CRF Models for Sequence Tagging arXiv: Computation and Language. ,(2015)
ARIEL S. SCHWARTZ, MARTI A. HEARST, A simple algorithm for identifying abbreviation definitions in biomedical text. pacific symposium on biocomputing. pp. 451- 462 ,(2002) , 10.1142/9789812776303_0042
Iain J. Marshall, Joël Kuiper, Byron C. Wallace, Automating risk of bias assessment for clinical trials Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics. pp. 88- 95 ,(2014) , 10.1145/2649387.2649406
Andrew Y. Ng, Feature selection, L1 vs. L2 regularization, and rotational invariance Twenty-first international conference on Machine learning - ICML '04. pp. 78- ,(2004) , 10.1145/1015330.1015435
Julian PT Higgins, Douglas G Altman, Peter C Gøtzsche, Peter Jüni, David Moher, Andrew D Oxman, Jelena Savović, Kenneth F Schulz, Laura Weeks, Jonathan AC Sterne, The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials BMJ. ,vol. 343, ,(2011) , 10.1136/BMJ.D5928
Hilda Bastian, Paul Glasziou, Iain Chalmers, Seventy-Five Trials and Eleven Systematic Reviews a Day: How Will We Ever Keep Up? PLOS Medicine. ,vol. 7, pp. 1- 6 ,(2010) , 10.1371/JOURNAL.PMED.1000326
Ilya Sutskever, Tomas Mikolov, Greg S Corrado, Kai Chen, Jeff Dean, Distributed Representations of Words and Phrases and their Compositionality neural information processing systems. ,vol. 26, pp. 3111- 3119 ,(2013)
Victor M Montori, Nancy L Wilczynski, Douglas Morgan, R Brian Haynes, Optimal search strategies for retrieving systematic reviews from Medline: analytical survey BMJ. ,vol. 330, pp. 68- 71 ,(2005) , 10.1136/BMJ.38336.804167.47