Information Extraction and Database Techniques: A User-Oriented Approach to Querying the Web

作者: Zoé Lacroix , Arnaud Sahuguet , Raman Chandrasekar

DOI: 10.1007/BFB0054231

关键词:

摘要: We propose a novel approach to querying the Web with system named AKIRA (Agentive Knowledge-based Information Retrieval Architecture) which combines advanced technologies from and Extraction together Database techniques. The former enable access explicit as well implicit structure of documents organize them into hierarchy concepts metaconcepts; latter provide tools for data-manipulation. useroriented approach: given user's query, extracts target (structure expressed in query) uses standard retrieval techniques potentially relevant documents. content these is processed using extraction (along flexible agentive structure) filter relevance extract or matching structure. information garnered used populate smart-cache (an object-oriented database) whose schema inferred This smart-cache, thus defined posteriori, populated queried an expression PIQL, our query language. integrates complementary maximum flexibility user offer transparent

参考文章(26)
William A. Woods, Conceptual Indexing: A Better Way to Organize Knowledge Sun Microsystems, Inc.. ,(1997)
Victor Vianu, Serge Abiteboul, Richard Hull, Foundations of databases ,(1994)
R. Chandrasekar, B. Srinivas, Using syntactic information in document filtering: a comparative study of part-of-speech tagging and supertagging RIAO '97 Computer-Assisted Information Searching on Internet. pp. 531- 545 ,(1997)
Paolo Merialdo, Paolo Atzeni, Giansalvatore Mecca, To Weave the Web very large data bases. pp. 206- 215 ,(1997)
Zoé Lacroix, Claude Delobel, Philippe Brèche, Object Views and Database Restructuring database programming languages. pp. 180- 201 ,(1997) , 10.1007/3-540-64823-2_11
Cássio Souza Santos, Serge Abiteboul, Claude Delobel, Virtual schemas and bases extending database technology. pp. 81- 94 ,(1994) , 10.1007/3-540-57818-8_43
Ion Androutsopoulos, Peter Thanisch, Graeme D. Ritchie, A Framework for Natural Language Interfaces to Temporal Databases arXiv: Computation and Language. ,(1996)
Mary Fernandez, Daniela Florescu, Alon Levy, Dan Suciu, A query language for a Web-site management system international conference on management of data. ,vol. 26, pp. 4- 11 ,(1997) , 10.1145/262762.262763
Serge Abiteboul, Paris C. Kanellakis, Object identity as a query language primitive international conference on management of data. ,vol. 18, pp. 159- 173 ,(1989) , 10.1145/66926.66941
Serge Abiteboul, Sophie Cluet, Vassilis Christophides, Tova Milo, Guido Moerkotte, Jérôme Siméon, Querying Documents in Object Databases International Journal on Digital Libraries. ,vol. 1, pp. 5- 19 ,(1997) , 10.1007/S007990050001