作者: Bo Liu , Zhenxing Li
DOI: 10.1109/FSKD.2015.7382138
关键词:
摘要: For the purpose of obtaining deep web query interface from forms accurately, this paper proposes a framework automatic discovery, which includes procedures collecting pages, extracting and features, filtering forms, identifying forms. A heuristic rule-based k-nearest neighbor algorithm for interfaces is introduced. In experiments, number non-query different domains are selected classifying interfaces. Experimental results demonstrate that presented can significantly improve accuracy discovery.