A Discriminant Framework for Detecting Similar Scientific Research Projects Based on Big Data Mining

作者: Shanqing Li , Lirong Song , Hui Zhao

DOI: 10.1109/BIGDATA.CONGRESS.2014.75

关键词:

摘要: Scientific research projects play an important role in promoting the science and technology competitiveness of a country. Due to lack information open sharing, it is possible approve similar or duplicated by different government departments. In some way, these are waste scientific resources. To avoid such problem, this paper proposes discriminant framework for detecting based on big data mining technologies, providing evidence-based decision making departments during project approval process. Firstly, we construct file associated with officially approved projects, including titles, principal investigators, organizations, keywords, bibliographies published scholar papers. Secondly, proposed detect from above file. Finally, adopt Hadoop architecture speed up algorithm. demonstrate effectiveness feasibility framework, implement prototype system detection.

参考文章(4)
Jane Reid, Mounia Lalmas, Karen Finesilver, Morten Hertzum, Best entry points for structured document retrieval: part I: characteristics formal methods. ,vol. 42, pp. 74- 88 ,(2006) , 10.1016/J.IPM.2005.03.006
Haijun Zhang, Tommy W.S. Chow, A multi-level matching method with hybrid similarity for document retrieval Expert Systems With Applications. ,vol. 39, pp. 2710- 2719 ,(2012) , 10.1016/J.ESWA.2011.08.128
Pawel Jan Kalczynski, Amy Chou, Temporal document retrieval model for business news archives Information Processing and Management. ,vol. 41, pp. 635- 650 ,(2005) , 10.1016/J.IPM.2004.01.002
Karen Finesilver, Morten Hertzum, Jane Reid, Mounia Lalmas, Best entry points for structured document retrieval-Part II: Types, usage and effectiveness formal methods. ,vol. 42, pp. 89- 105 ,(2006) , 10.1016/J.IPM.2005.03.002