Ontology-Based Information Extraction from the World Wide Web

作者: Jan Korst , Gijs Geleijnse , Nick de Jong , Michael Verschoor

DOI: 10.1007/1-4020-4995-1_10

关键词:

摘要: We study possibilities to automatically extract information from the Internet, by structuring and combining data web pages. The pages are found with use of a search engine is structured using ontologies. ontologies populated statistical linguistic techniques. present results case that aimed at finding names famous persons. indicate that, even if we only summaries Google provides pages, approach in high precision recall for specific application.

参考文章(13)
Katerina T. Frantzi, Jun'ichi Tsujii, Sophia Ananiadou, Classifying Technical Terms international conference on electronic publishing. ,(1999)
Jeroen Breebaart, Martin F. McKinney, Features for Audio Classification Philips Research. pp. 113- 129 ,(2004) , 10.1007/978-94-017-0703-9_6
Gijs Geleijnse, Jan H. M. Korst, Automatic Ontology Population by Googling. belgium-netherlands conference on artificial intelligence. pp. 120- 126 ,(2005)
Philip Stokes, Philosophy 100 Essential Thinkers ,(2003)
Sergey Brin, Extracting Patterns and Relations from the World Wide Web Lecture Notes in Computer Science. pp. 172- 183 ,(1999) , 10.1007/10704656_11
Andreas Faatz, Ralf Steinmetz, Ontology enrichment with texts from the WWW ,(2002)
William B. Frakes, Ricardo Baeza-Yates, Information Retrieval: Data Structures and Algorithms ,(1992)
Cody C. T. Kwok, Oren Etzioni, Daniel S. Weld, Scaling question answering to the Web Proceedings of the tenth international conference on World Wide Web - WWW '01. pp. 150- 161 ,(2001) , 10.1145/371920.371973