Dataset search: a survey

作者: Elena Simperl , Paul Groth , Adriane Chapman , Emilia Kacprzak , Laura Koesten

DOI:

关键词:

摘要: Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway encourage sharing reuse, scientific publishers asking authors submit alongside manuscripts marketplaces, open portals communities. Google recently beta released a search service for datasets, which allows users discover stored in various online repositories via keyword queries. These developments foreshadow an emerging research field around dataset or retrieval that broadly encompasses frameworks, methods tools help match user need against collection Here, we survey state art commercial systems retrieval. We identify what makes its own right, with unique challenges highlight problems. look at approaches implementations related areas is drawing upon, including information retrieval, databases, entity-centric tabular order possible paths resolve these problems as well immediate next steps will take forward.

参考文章(129)
Muhammad Saleem, Axel-Cyrille Ngonga Ngomo, HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation european semantic web conference. pp. 176- 191 ,(2014) , 10.1007/978-3-319-07443-6_13
Magdalena Balazinska, Bill Howe, Paraschos Koutris, Dan Suciu, Prasang Upadhyaya, A Discussion on Pricing Relational Data In Search of Elegance in the Theory and Practice of Computation. ,vol. 8000, pp. 167- 173 ,(2013) , 10.1007/978-3-642-41660-6_7
Pieter Heyvaert, Pieter Colpaert, Ruben Verborgh, Erik Mannens, Rik Van de Walle, Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets The Semantic Web: ESWC 2015 Satellite Events. ,vol. 9341, pp. 67- 71 ,(2015) , 10.1007/978-3-319-25639-9_13
Vassilis Christophides, Kostas Stefanidis, Vasilis Efthymiou, Melanie Herschel, Entity Resolution in the Web of Data ,(2015)
Chenyun Dai, Dan Lin, Elisa Bertino, Murat Kantarcioglu, An Approach to Evaluate Data Trustworthiness Based on Data Provenance very large data bases. pp. 82- 98 ,(2008) , 10.1007/978-3-540-85259-9_6
Shubham Gupta, Pedro Szekely, Craig A. Knoblock, Aman Goel, Mohsen Taheriyan, Maria Muslea, Karma: A System for Mapping Structured Sources into the Semantic Web extended semantic web conference. pp. 430- 434 ,(2012) , 10.1007/978-3-662-46641-4_40
Thanh Tam Nguyen, Quoc Viet Hung Nguyen, Matthias Weidlich, Karl Aberer, Result selection and summarization for Web Table search international conference on data engineering. pp. 231- 242 ,(2015) , 10.1109/ICDE.2015.7113287