What drives research efforts?: find scientific claims that count!

作者: Jose Maria Gonzalez Pinto , Janus Wawrzinek , Wolf-Tilo Balke

DOI: 10.1109/JCDL.2019.00038

关键词:

摘要: Researchers often struggle to solve a common problem: how does one know whether research hypothesis is worth investigating? Given the increasing number of publications, it complicated guide such decisions. Previous work has shown predicting generally emerging topics can provide some help. Yet, in specialized scientific domains, only little known about service that allows users ease identification claims investigating. Scientific here means natural language sentence expresses relationship between two entities. In particular, them affects, manipulates, or causes other entity. this paper, we propose data-driven approach aiming at filling gap and empowering query level: given results query, deliver characterization clusters discover contextualization those may be more efforts. To do so, cluster documents with share same context by leveraging co-clustering. After that, characterize annotate them. Our annotation focuses on core aspects: controversy diversity cluster. Controversy arises when semantically contradict each other; presence different semantics not but insights expressed paper. evaluate benefits our approach, performed an extensive retrospective analysis PubMed.

参考文章(40)
Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni, Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni, An Introduction to Information Retrieval Springer, Berlin, Heidelberg. pp. 3- 11 ,(2013) , 10.1007/978-3-642-39314-3_1
Xin Ying Qiu, Padmini Srinivasan, Marc Light, The Language of Bioscience: Facts, Speculations, and Statements in Between north american chapter of the association for computational linguistics. pp. 17- 24 ,(2004)
Bongjune Kwon, Hyuk Cho, Scalable co-clustering algorithms international conference on algorithms and architectures for parallel processing. pp. 32- 43 ,(2010) , 10.1007/978-3-642-13119-6_3
David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams, Learning representations by back-propagating errors Nature. ,vol. 323, pp. 696- 699 ,(1988) , 10.1038/323533A0
Hinrich Schütze, Christopher D. Manning, Prabhakar Raghavan, Introduction to Information Retrieval ,(2005)
Tomas Mikolov, Greg S. Corrado, Kai Chen, Jeffrey Dean, Efficient Estimation of Word Representations in Vector Space international conference on learning representations. ,(2013)
Klaus Greff, Rupesh K. Srivastava, Jan Koutnik, Bas R. Steunebrink, Jurgen Schmidhuber, LSTM: A Search Space Odyssey IEEE Transactions on Neural Networks. ,vol. 28, pp. 2222- 2232 ,(2017) , 10.1109/TNNLS.2016.2582924
Yoon Kim, Convolutional Neural Networks for Sentence Classification empirical methods in natural language processing. pp. 1746- 1751 ,(2014) , 10.3115/V1/D14-1181
Ilya Sutskever, Geoffrey E. Hinton, Alex Krizhevsky, Ruslan R. Salakhutdinov, Nitish Srivastava, Improving neural networks by preventing co-adaptation of feature detectors arXiv: Neural and Evolutionary Computing. ,(2012)
Yi Sun, Xiaogang Wang, Xiaoou Tang, Deep Convolutional Network Cascade for Facial Point Detection computer vision and pattern recognition. pp. 3476- 3483 ,(2013) , 10.1109/CVPR.2013.446