作者: Yanbo Ru , Ellis Horowitz
DOI: 10.1108/14684520510607579
关键词:
摘要: Purpose – The existence and continued growth of the invisible web creates a major challenge for search engines that are attempting to organize all material on into form is easily retrieved by users. purpose this paper identify challenges problems underlying existing work in area.Design/methodology/approach A discussion based short survey prior work, including automated discovery site interfaces, classification sites, label assignment filling, information extraction from resulting pages, learning query language interface, building content summary an site, selecting proper databases, integrating web‐search accessing performance site.Findings Existing technologies tools indexing follow one two strategies: interface or examining portion con...