Using hybrid algorithmic-crowdsourcing methods for academic knowledge acquisition

作者: Zhaoan Dong , Jiaheng Lu , Tok Wang Ling , Ju Fan , Yueguo Chen

DOI: 10.1007/S10586-017-1089-8

关键词:

摘要: Scientific literature contains a lot of meaningful objects such as Figures, Tables, Definitions, Algorithms, etc., which are called Knowledge Cells hereafter. An advanced academic search engine could take advantage and their various relationships to obtain more accurate results is expected. Further, it’s expected provide fine-grained regarding for deep-level information discovery exploration. Therefore, it important identify extract the often intrinsic implicit in articles. With exponential growth scientific publications, acquisition useful knowledge impose some practical challenges For example, existing algorithmic methods can hardly extend handle diverse layouts journals, nor scale up process massive documents. As crowdsourcing has become powerful paradigm large problem-solving especially tasks that difficult computers but easy human, we consider problem crowd-sourced database show hybrid framework integrate accuracy workers speed automatic algorithms. In this paper, introduce our current system implementation, platform (PANDA), well interesting observations promising future directions.

参考文章(32)
Panagiotis G. Ipeirotis, Foster Provost, Jing Wang, Quality management on Amazon Mechanical Turk knowledge discovery and data mining. pp. 64- 67 ,(2010) , 10.1145/1837885.1837906
Sarath Kumar Kondreddi, Peter Triantafillou, Gerhard Weikum, Combining information extraction and human computing for crowdsourced knowledge acquisition international conference on data engineering. pp. 988- 999 ,(2014) , 10.1109/ICDE.2014.6816717
Eric Horvitz, Severin Hacker, Ece Kamar, Combining human and machine intelligence in large-scale crowdsourcing adaptive agents and multi-agents systems. pp. 467- 474 ,(2012) , 10.5555/2343576.2343643
Stefan Klampfl, Michael Granitzer, Kris Jack, Roman Kern, Unsupervised document structure analysis of digital scientific articles International Journal on Digital Libraries. ,vol. 14, pp. 83- 99 ,(2014) , 10.1007/S00799-014-0115-1
Mahmoud Alewiwi, Cengiz Orencik, Erkay Savaş, Efficient top-k similarity document search utilizing distributed file systems and cosine similarity Cluster Computing. ,vol. 19, pp. 109- 126 ,(2016) , 10.1007/S10586-015-0506-0
L.I. Kuncheva, C.J. Whitaker, C.A. Shipp, R.P.W. Duin, Limits on the majority vote accuracy in classifier fusion Pattern Analysis and Applications. ,vol. 6, pp. 22- 31 ,(2003) , 10.1007/S10044-002-0173-7
Nuno Luz, Nuno Silva, Paulo Novais, Generating Human-Computer Micro-task Workflows from Domain Ontologies international conference on human computer interaction. ,vol. 8510, pp. 98- 109 ,(2014) , 10.1007/978-3-319-07233-3_10
Jeffrey Rzeszotarski, Aniket Kittur, CrowdScape Proceedings of the 25th annual ACM symposium on User interface software and technology - UIST '12. pp. 55- 62 ,(2012) , 10.1145/2380116.2380125
Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran, Evaluating the crowd with confidence knowledge discovery and data mining. pp. 686- 694 ,(2013) , 10.1145/2487575.2487595
Michael J. Franklin, Donald Kossmann, Tim Kraska, Sukriti Ramesh, Reynold Xin, CrowdDB: answering queries with crowdsourcing international conference on management of data. pp. 61- 72 ,(2011) , 10.1145/1989323.1989331