摘要: Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway encourage sharing reuse, scientific publishers asking authors submit alongside manuscripts marketplaces, open portals communities. Google recently beta-released a search service for datasets, which allows users discover stored in various online repositories via keyword queries. These developments foreshadow an emerging research field around dataset or retrieval that broadly encompasses frameworks, methods tools help match user need against collection Here, we survey state art commercial systems discuss what makes its own right, with unique challenges questions. We look at approaches implementations related areas is drawing upon, including information retrieval, databases, entity-centric tabular order identify possible paths tackle these questions as well immediate next steps will take forward.

参考文章(186)
Luca Costabello, Serena Villata, Oscar Rodriguez Rocha, Fabien Gandon, Access Control for HTTP Operations on Linked Data extended semantic web conference. pp. 185- 199 ,(2013) , 10.1007/978-3-642-38288-8_13
Muhammad Saleem, Axel-Cyrille Ngonga Ngomo, HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation european semantic web conference. pp. 176- 191 ,(2014) , 10.1007/978-3-319-07443-6_13
Magdalena Balazinska, Bill Howe, Paraschos Koutris, Dan Suciu, Prasang Upadhyaya, A Discussion on Pricing Relational Data In Search of Elegance in the Theory and Practice of Computation. ,vol. 8000, pp. 167- 173 ,(2013) , 10.1007/978-3-642-41660-6_7
Berthier A. Ribeiro-Neto, Ricardo A. Baeza-Yates, Modern Information Retrieval - the concepts and technology behind search, Second edition Pearson Education Ltd., Harlow, England. ,(2011)
Anja Jentzsch, Linked Open Data Cloud Linked Enterprise Data. pp. 209- 219 ,(2014) , 10.1007/978-3-642-30274-9_10
Pieter Heyvaert, Pieter Colpaert, Ruben Verborgh, Erik Mannens, Rik Van de Walle, Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets The Semantic Web: ESWC 2015 Satellite Events. ,vol. 9341, pp. 67- 71 ,(2015) , 10.1007/978-3-319-25639-9_13
Vassilis Christophides, Kostas Stefanidis, Vasilis Efthymiou, Melanie Herschel, Entity Resolution in the Web of Data ,(2015)
Chenyun Dai, Dan Lin, Elisa Bertino, Murat Kantarcioglu, An Approach to Evaluate Data Trustworthiness Based on Data Provenance very large data bases. pp. 82- 98 ,(2008) , 10.1007/978-3-540-85259-9_6
Xian Li, Xin Luna Dong, Kenneth Lyons, Weiyi Meng, Divesh Srivastava, Truth finding on the deep web Proceedings of the VLDB Endowment. ,vol. 6, pp. 97- 108 ,(2012) , 10.14778/2535568.2448943