Citizen Science for Mining the Biomedical Literature

作者: Ginger Tsueng , Steven M. Nanis , Jennifer Fouquier , Benjamin M Good , Andrew I Su

DOI: 10.1101/038083

关键词:

摘要: Biomedical literature represents one of the largest and fastest growing collections unstructured biomedical knowledge. Finding critical information buried in can be challenging. In order to extract from freeflowing text, researchers need to: 1. identify entities text (named entity recognition), 2. apply a standardized vocabulary these (normalization), 3. how are related another (relationship extraction.) Researchers have primarily approached extraction tasks through manual expert curation, computational methods. We previously demonstrated that named recognition (NER) crowdsourced group nonexperts via paid microtask platform, Amazon Mechanical Turk (AMT); dramatically reduce cost increase throughput biocuration efforts. However, given size even platforms is not scalable. With our web-based application Mark2Cure ( http://mark2cure.org ), we demonstrate NER also performed by volunteer citizen scientists with high accuracy. metrics Zooniverse Matrices Citizen Science Success provide results here serve as basis comparison for other science projects. Further, discuss design considerations, issues, analytics successfully moving crowdsourcing workflow platform platform. To knowledge, this study first natural language processing task.

参考文章(25)
Kazuki Saito, Yoshiko Takagi, Ho Chai Ling, Hideki Takahashi, Masaaki Noji, Molecular cloning, characterization and expression of cDNA encoding phosphoserine aminotransferase involved in phosphorylated pathway of serine biosynthesis from spinach. Plant Molecular Biology. ,vol. 33, pp. 359- 366 ,(1997) , 10.1023/A:1005730725764
David Campos, Sergio Matos, Jose Luis, Biomedical Named Entity Recognition: A Survey of Machine-Learning Tools InTech. ,(2012) , 10.5772/51066
Joe Cox, Eun Young Oh, Brooke Simmons, Chris Lintott, Karen Masters, Anita Greenhill, Gary Graham, Kate Holmes, Defining and Measuring Success in Online Citizen Science: A Case Study of Zooniverse Projects computational science and engineering. ,vol. 17, pp. 28- 41 ,(2015) , 10.1109/MCSE.2015.65
Lutz Bornmann, Rüdiger Mutz, Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references association for information science and technology. ,vol. 66, pp. 2215- 2222 ,(2015) , 10.1002/ASI.23329
Zbigniew Czech, Agnieszka Kowalczyk, Lu Shao, Xi-Quan Cheng, Shai Quan, Yong-Ping Bai, Novel acrylic pressure-sensitive adhesive (PSA) containing silver particles Journal of Adhesion Science and Technology. ,vol. 27, pp. 1446- 1454 ,(2013) , 10.1080/01694243.2012.742402
Andrea Wiggins, Kevin Crowston, Surveying the citizen science landscape First Monday. ,vol. 20, ,(2014) , 10.5210/FM.V20I1.5520
BENJAMIN M GOOD, MAX NANIS, CHUNLEI WU, ANDREW I SU, Microtask crowdsourcing for disease mention annotation in PubMed abstracts. pacific symposium on biocomputing. pp. 282- 293 ,(2014) , 10.1142/9789814644730_0028
Raymond J. Mooney, Razvan Bunescu, Mining knowledge from text using information extraction ACM SIGKDD Explorations Newsletter. ,vol. 7, pp. 3- 10 ,(2005) , 10.1145/1089815.1089817
Don R. Swanson, Fish oil, Raynaud's syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine. ,vol. 30, pp. 7- 18 ,(1986) , 10.1353/PBM.1986.0087