Evaluation of unique identifiers used as keys to match identical publications in Pure and SciVal - a case study from health science.

作者: Heidi Holst Madsen , Dicte Madsen , Marianne Gauffriau

DOI: 10.12688/F1000RESEARCH.8913.2

关键词:

摘要: Unique identifiers (UID) are seen as an effective key to match identical publications across databases or identify duplicates in a database. The objective of the present study is investigate how well UIDs work keys integration between Pure and SciVal, based on case with from health sciences. We evaluate matching process information about coverage, precision, characteristics matched versus not keys. analyze this detect errors, if any, process. As example we also briefly discuss publication sets formed by using may affect bibliometric indicators number publications, citations, average citations per publication.  The addressed literature review study. shows that only few studies key. From four error types: Duplicate digital object (DOI), incorrect DOIs reference lists databases, registered database where analysis performed, erroneous optical special character recognition. explores use SciVal. Specifically journal English two databases. find all types except recognition our sets. In particular duplicate constitute problem for calculation both keeping improve reliability citation counts deleting them will distort publication. linking implemented many settings, availability become critical inclusion analysis.

参考文章(17)
James M. Ostell, Sarah J. Wheelan, Jonathan A. Kans, The NCBI Data Model Methods of Biochemical Analysis. ,vol. 43, pp. 19- 43 ,(2001) , 10.1002/0471223921.CH2
Philippe Mongeon, Adèle Paul-Hus, None, The journal coverage of Web of Science and Scopus: a comparative analysis Scientometrics. ,vol. 106, pp. 213- 228 ,(2016) , 10.1007/S11192-015-1765-5
Marlies Olensky, Marion Schmidt, Nees Jan van Eck, None, Evaluation of the citation matching algorithms of CWTS and iFQ in comparison to the Web of science association for information science and technology. ,vol. 67, pp. 2550- 2564 ,(2016) , 10.1002/ASI.23590
Stefanie Haustein, Tobias Siebenlist, Applying social bookmarking data to evaluate journal usage Journal of Informetrics. ,vol. 5, pp. 446- 457 ,(2011) , 10.1016/J.JOI.2011.04.002
Kwang-Young Kim, Hwan-Min Kim, A Study on Developing and Refining a Large Citation Service System International Journal of Knowledge Content Development and Technology. ,vol. 3, pp. 65- 80 ,(2013) , 10.5865/IJKCT.2013.3.1.065
James A. Hammerton, Michael Granitzer, Dan Harvey, Maya Hristakeva, Kris Jack, On generating large-scale ground truth datasets for the deduplication of bibliographic records web intelligence, mining and semantics. pp. 18- ,(2012) , 10.1145/2254129.2254153
Yu Jiang, Can Lin, Weiyi Meng, Clement Yu, Aaron M. Cohen, Neil R. Smalheiser, Rule-based deduplication of article records from bibliographic databases. Database. ,vol. 2014, ,(2014) , 10.1093/DATABASE/BAT086
Fiorenzo Franceschini, Domenico Maisano, Luca Mastrogiacomo, A Novel Approach for Estimating the Omitted-Citation Rate of Bibliometric Databases With an Application to the Field of Bibliometrics Journal of the Association for Information Science and Technology. ,vol. 64, pp. 2149- 2156 ,(2013) , 10.1002/ASI.22898