作者: Giovanni Tummarello , Renaud Delbru , Stéphane Campinas , Krisztian Balog , Diego Ceccarelli
DOI:
关键词:
摘要: The task of entity retrieval becomes increasingly prevalent as more and (semi-) structured information about objects is available on the Web in form documents embedding metadata (RDF, RDFa, Microformats, others). However, research development that direction dependent (1) availability a representative corpus entities are found Web, (2) an entity-oriented search infrastructure for experimenting with new models. In this paper, we introduce Sindice-2011 data collection which derived from collected by Sindice semantic engine. (available at http://data.sindice.com/trec2011/) especially designed supporting domain web retrieval. We describe how organised, discuss statistics collection, to foster development.