An architecture for extracting information from hidden web databases using intelligent agent technology through reinforcement learning

作者: Lohit Singh , Dilip Kumar Sharma

DOI: 10.1109/CICT.2013.6558108

关键词:

摘要: The web contains enormous amount of information. From that information only small is visible to users and a huge portion the not users. This because traditional search engines are able index or access all which can be retrieved by following hypertext links accessed such engines. forms include login authorization process. Hidden refers part crawlers. An important problem retrieving desired good quality from hidden database how find out identify entry points databases i.e., forms, in Web. crawlers may unable retrieve deep databases. Therefore it main cause motivation for web. Issues challenges related also discussed. architecture accessing uses an intelligent agent technology through reinforcement learning proposed. experimental results show helps overcoming existing problems outperforms terms precision recall.

参考文章(21)
Kathleen O'Connor, K O'Connor, Learning: An Introduction ,(1968)
JRA McCallum, Jason Rennie, Using Reinforcement Learning to Spider the Web Efficiently international conference on machine learning. pp. 335- 343 ,(1999)
Lu Jiang, Zhaohui Wu, Qian Feng, Jun Liu, Qinghua Zheng, Efficient deep web crawling using reinforcement learning knowledge discovery and data mining. pp. 428- 439 ,(2010) , 10.1007/978-3-642-13657-3_46
Fidel Cacheda, Víctor Carneiro, Juan Raposo, Alberto Pan, Manuel Álvarez, Fernando Bellas, DeepBot: a focused crawler for accessing hidden web content electronic commerce. pp. 18- 25 ,(2007) , 10.1145/1278380.1278385
Qiuyan Huang, Qingzhong Li, Hong Li, Zhongmin Yan, An Approach to Incremental Deep Web Crawling Based on Incremental Harvest Model Procedia Engineering. ,vol. 29, pp. 1081- 1087 ,(2012) , 10.1016/J.PROENG.2012.01.093
Dilip Kumar Sharma, A. K. Sharma, A Novel Architecture for Deep Web Crawler International Journal of Information Technology and Web Engineering. ,vol. 6, pp. 25- 48 ,(2011) , 10.4018/JITWE.2011010103
Dilip Kumar Sharma, A. K. Sharma, Deep Web Information Retrieval Process: A Technical Survey International Journal of Information Technology and Web Engineering. ,vol. 5, pp. 1- 22 ,(2010) , 10.4018/JITWE.2010010101
Weicheng Ma, Xiuxia Chen, Wenqian Shang, Advanced Deep Web Crawler Based on Dom computational sciences and optimization. pp. 605- 609 ,(2012) , 10.1109/CSO.2012.138
Yanni Li, Yuping Wang, Jintao Du, E-FFC: an enhanced form-focused crawler for domain-specific deep web databases intelligent information systems. ,vol. 40, pp. 159- 184 ,(2013) , 10.1007/S10844-012-0221-8
Alexandros Ntoulas, Petros Zerfos, Junghoo Cho, Downloading textual hidden web content through keyword queries acm/ieee joint conference on digital libraries. pp. 100- 109 ,(2005) , 10.1145/1065385.1065407