Deep context of citations using machine-learning models in scholarly full-text articles

作者: Saeed-Ul Hassan , Mubashir Imran , Sehrish Iqbal , Naif Radi Aljohani , Raheel Nawaz

DOI: 10.1007/S11192-018-2944-Y

关键词:

摘要: Information retrieval systems for scholarly literature rely heavily not only on text matching but semantic- and context-based features. Readers nowadays are deeply interested in how important an article is, its purpose influential it is follow-up research work. Numerous techniques to tap the power of machine learning artificial intelligence have been developed enhance most scientific literature. In this paper, we compare improve four existing state-of-the-art designed identify citations. We consider 450 citations from Association Computational Linguistics corpus, classified by experts as either or unimportant, further extract 64 features based methodology techniques. apply Extra-Trees classifier select 29 best Random Forest Support Vector Machine classifiers all selected Using classifier, our supervised model improves method 11.25%, with 89% Precision-Recall area under curve. Finally, present deep-learning model, Long Short-Term Memory network, that uses distinguish unimportant 92.57% accuracy.

参考文章(49)
Michael Bett, Alex Waibel, Ralph Gross, Jie Yang, Xiaojin Zhu, Hua Yu, Yue Pan, Multimodal meeting tracker riao conference. pp. 32- 45 ,(2000)
Shashank Agarwal, Hong Yu, Lisha Choubey, Automatically classifying the role of citations in biomedical articles. american medical informatics association annual symposium. ,vol. 2010, pp. 11- 15 ,(2010)
Mark Garzone, Robert E. Mercer, Towards an Automated Citation Classifier Lecture Notes in Computer Science. pp. 337- 346 ,(2000) , 10.1007/3-540-45486-1_28
Hidetsugu Nanba, Manabu Okumura, Towards Multi-paper Summarization Using Reference Information international joint conference on artificial intelligence. pp. 926- 931 ,(1999)
Laura Auria, R. A. Moro, Support Vector Machines (SVM) as a Technique for Solvency Analysis SSRN Electronic Journal. ,(2008) , 10.2139/SSRN.1424949
Ying Ding, Guo Zhang, Tamy Chambers, Min Song, Xiaolong Wang, Chengxiang Zhai, None, Content-Based Citation Analysis: The Next Generation of Citation Analysis Journal of the Association for Information Science and Technology. ,vol. 65, pp. 1820- 1833 ,(2014) , 10.1002/ASI.23256
H. Small, E. Greenlee, Citation context analysis of a co-citation cluster: Recombinant-DNA Scientometrics. ,vol. 2, pp. 277- 301 ,(1980) , 10.1007/BF02016349
Paul Zhang, Lavanya Koppaka, Semantics-based legal citation network international conference on artificial intelligence and law. pp. 123- 130 ,(2007) , 10.1145/1276318.1276342