VR-Tree: A novel tree-based approach for modeling Web Query Interfaces

作者: Heidy M. Marin-Castro , Victor J. Sosa Sosa

DOI: 10.1007/S10844-017-0449-4

关键词: Degree of precisionTree basedSchema (psychology)Deep WebWeb browserComputer scienceWeb search queryData mining

摘要: Web Query Interfaces (WQIs) play a very important role in retrieving Deep content. WQIs allow users to query domain-specific databases for obtaining information of interest from diverse domains such as car rentals, hotels, airfare, etc. As the number on web is increasing drastically, some research efforts are focused building single (unified) WQI that allows and integrate available different related specific domain. A task this WQIs’ integration process extraction, modeling understanding semantic However, challenging because great heterogeneity design WQIs. This paper presents novel tree-based approach tree schema called Visual Reduced Tree (VR-Tree) built produced by browser’s render engine, applying set well- defined functions guided heuristic rules identify WQI’s main components their relationships. The proposed strategy was evaluated running collection experiments over Tel-8 ICQ datasets UIUC repository. results show automatic possible with high degree precision if compared against previous approaches, simplifying only considering visual spatial properties using VR-Tree work.

参考文章(16)
Wensheng Wu, AnHai Doan, Clement Yu, Weiyi Meng, Modeling and Extracting Deep-Web Query Interfaces Advances in Information and Intelligent Systems. pp. 65- 90 ,(2009) , 10.1007/978-3-642-04141-9_4
Hai He, Weiyi Meng, Clement Yu, Zonghuan Wu, Constructing interface schemas for search interfaces of web databases web information systems engineering. pp. 29- 42 ,(2005) , 10.1007/11581062_3
Radhouane Boughammoura, Lobna Hlaoua, Mohamed Nazih Omri, VIQI: A new approach for visual interpretation of deep web query interfaces international conference on information technology. ,vol. 1, pp. 1- 6 ,(2012) , 10.1109/ICITES.2012.6216656
Tim Furche, Georg Gottlob, Giovanni Grasso, Xiaonan Guo, Giorgio Orsi, Christian Schallhart, OPAL: automated form understanding for the deep web the web conference. pp. 829- 838 ,(2012) , 10.1145/2187836.2187948
Heidy M. Marin-Castro, Victor J. Sosa-Sosa, Jose F. Martinez-Trinidad, Ivan Lopez-Arevalo, Automatic discovery of Web Query Interfaces using machine learning techniques intelligent information systems. ,vol. 40, pp. 85- 108 ,(2013) , 10.1007/S10844-012-0217-4
Hai He, Weiyi Meng, Clement Yu, Zonghuan Wu, Automatic integration of Web search interfaces with WISE-Integrator very large data bases. ,vol. 13, pp. 256- 273 ,(2004) , 10.1007/S00778-004-0126-4
Oliver Kaljuvee, Orkut Buyukkokten, Hector Garcia-Molina, Andreas Paepcke, Efficient Web form entry on PDAs Proceedings of the tenth international conference on World Wide Web - WWW '01. pp. 663- 672 ,(2001) , 10.1145/371920.372180
Nicholas Kushmerick, Learning to invoke Web forms Lecture Notes in Computer Science. pp. 997- 1013 ,(2003) , 10.1007/978-3-540-39964-3_63
Thanh Nguyen, Juliana Freire, Learning to extract form labels very large data bases. ,vol. 1, pp. 684- 694 ,(2008) , 10.14778/1453856.1453931
Ritu Khare, Yuan An, An empirical study on using hidden markov model for search interface segmentation conference on information and knowledge management. pp. 17- 26 ,(2009) , 10.1145/1645953.1645959