A tree-based WQI modeling approach for integrating Web databases

作者: Heidy M. Marin-Castro , Victor J. Sosa-Sosa , Ivan Lopez-Arevalo

DOI:

关键词:

摘要: Everyday, more and specialized databases (car rental, hotels, airfares, etc.) are available on the Web can be only queried by means of a Query Interface (WQI). Since in is increasing number domain-specific databases, it getting very complicated for end users to explore information stored them. In this context, research efforts focused building single (unified) specific-domain WQI that allows user query integrate different databases. The construction such integrated WQI, given domain, involves several complex tasks, specially extraction, representation, understanding mapping semantic content each individual associated web database. Previous approaches have considered hierarchical models build preserving ancestor-descendant relationships WQIs. work, we propose novel tree-based approach automatic model visual WQIs, representing their components clear concise form. proposed approach, Document Object Model(DOM) tree integration process processed resource obtain relevant as fields (UIs), groups UIs super-groups well corresponding labels. This guided set 8 design heuristic rules right identification labels components. Experiments evaluate strategy were conducted ICQ Tel-8 datasets UIUC repository. Our results showed has than 94% accuracy, improving current reported making easier domain-specifi

参考文章(0)