Effective Focused Crawling Based on Content and Link Structure Analysis

作者: Deepak Singh Tomar , Anshika Pal , S. C. Shrivastava

DOI:

关键词:

摘要: … This paper, presented a method for focused crawling that allows the crawler to go through several irrelevant pages to get to the next relevant one when the current page is irrelevant. …

参考文章(8)
JRA McCallum, Jason Rennie, Using Reinforcement Learning to Spider the Web Efficiently international conference on machine learning. pp. 335- 343 ,(1999)
Soumen Chakrabarti, Martin van den Berg, Byron Dom, Focused crawling: a new approach to topic-specific Web resource discovery the web conference. ,vol. 31, pp. 1623- 1640 ,(1999) , 10.1016/S1389-1286(99)00052-3
M. Shokouhi, P. Chubak, Z. Raeesy, Enhancing focused crawling with genetic algorithms international conference on information technology coding and computing. ,vol. 2, pp. 503- 508 ,(2005) , 10.1109/ITCC.2005.145
Junghoo Cho, Hector Garcia-Molina, Lawrence Page, Efficient crawling through URL ordering the web conference. ,vol. 30, pp. 161- 172 ,(1998) , 10.1016/S0169-7552(98)00108-1
P.M.E. De Bra, R.D.J. Post, Information retrieval in the World-Wide Web: making client-based searching feasible the web conference. ,vol. 27, pp. 183- 192 ,(1994) , 10.1016/0169-7552(94)90132-5
Yulian Zhang, Chunxia Yin, Fuyong Yuan, An Application of Improved PageRank in Focused Crawler fuzzy systems and knowledge discovery. ,vol. 2, pp. 331- 335 ,(2007) , 10.1109/FSKD.2007.142
Jon M. Kleinberg, Authoritative sources in a hyperlinked environment Journal of the ACM. ,vol. 46, pp. 604- 632 ,(1999) , 10.1145/324133.324140
Debajyoti Mukhopadhyay, Sukanta Sinha, Arup Biswas, A New Approach to Design Domain Specific Ontology Based Web Crawler Journal of Computing and Information Technology. ,(2007)