Data.dcs:converting legacy data into linked data

作者: Matthew Rowe

DOI:

关键词: Web siteLegacy dataRSSCoreferenceData scienceResearch groupsLinked dataResolution (logic)Computer scienceWorld Wide Web

摘要: Data.dcs is a project intended to produce Linked Data describing the University of Sheffield’s Department Computer Science. At present department’s web site contains important legacy data people, publications and research groups. This distributed provided in heterogeneous formats (e.g. HTML documents, RSS feeds), making it hard for machines make sense such query it. paper presents an approach convert from its current form into machine-readable representation which linked theWeb Data. The describes triplification data, coreference resolution interlinking with external datasets.

参考文章(16)
Michael J. Cafarella, Oren Etzioni, Stephen Soderland, Michele Banko, Matt Broadhead, Open information extraction from the web international joint conference on artificial intelligence. pp. 2670- 2676 ,(2007)
Fabio Ciravegna, Sam Chapman, Alexiei Dingli, Yorick Wilks, Learning to harvest information for the semantic web Lecture Notes in Computer Science. pp. 312- 326 ,(2004) , 10.1007/978-3-540-25956-5_22
Knud Möller, Tom Heath, Siegfried Handschuh, John Domingue, Recipes for semantic web dog food: the ESWC and ISWC metadata projects international semantic web conference. ,vol. 4825, pp. 802- 815 ,(2007) , 10.1007/978-3-540-76298-0_58
Erik Hetzner, A simple method for citation metadata extraction using hidden markov models Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries - JCDL '08. pp. 280- 284 ,(2008) , 10.1145/1378889.1378937
Andrew Kachites McCallum, Dayne Freitag, Information Extraction with HMMs and Shrinkage ,(1999)
George R. Thoma, Jie Zou, Daniel Le, Structure and content analysis for html medical articles Proceedings of the 2007 ACM symposium on Document engineering - DocEng '07. pp. 199- 201 ,(2007) , 10.1145/1284420.1284468
Danushka Bollegala, Yutaka Matsuo, MitsuruIshizuka, Keigo Watanabe, A Two-Step Approach to Extracting Attributes for People on the Web ,(2009)
Ian Millard, Hugh Glaser, Afraz Jaffri, URI Disambiguation in the Context of Linked Data LDOW. ,(2008)
Richard Cyganiak, Jun Zhao, Michael Hausenblas, Keith Alexander, Describing Linked Datasets On the Design and Usage of voiD, the "Vocabulary Of Interlinked Datasets" ,(2009)
Xiaojun Wan, Jianfeng Gao, Mu Li, Binggong Ding, Person resolution in person search results Proceedings of the 14th ACM international conference on Information and knowledge management - CIKM '05. pp. 163- 170 ,(2005) , 10.1145/1099554.1099585