XWRAP: an XML-enabled wrapper construction system for Web information sources

作者: L. Liu , C. Pu , W. Han

DOI: 10.1109/ICDE.2000.839475

关键词: XMLMetadataInformation extractionConstruct (python library)Computer scienceProgramming languageExecutableUser interfaceInformation retrievalCode generationWeb page

摘要: The paper describes the methodology and the software development of XWRAP, an XML-enabled wrapper construction system for semi-automatic generation of wrapper programs. By …

参考文章(13)
Nicholas Kushmerick, Daniel S. Weld, Wrapper induction for information extraction international joint conference on artificial intelligence. pp. 729- 737 ,(1997)
Shankar Pal, David Shutt, Thomas Bergstraesser, Philip A. Bernstein, Versions and workspaces in Microsoft repository ACM SIGMOD Record. ,vol. 28, pp. 532- 533 ,(1999) , 10.1145/304181.304248
Philip A. Bernstein, Thomas Bergstraesser, Jason Carlson, Shankar Pal, Paul Sanders, David Shutt, Microsoft repository version 2 and the open information model Information Systems. ,vol. 24, pp. 71- 98 ,(1999) , 10.1016/S0306-4379(99)00006-X
Paolo Atzeni, Giansalvatore Mecca, Cut and paste symposium on principles of database systems. ,vol. 58, pp. 144- 153 ,(1997) , 10.1145/263661.263678
Ling Liu, C. Pu, Wei Tang, Continual queries for Internet scale event-driven information delivery IEEE Transactions on Knowledge and Data Engineering. ,vol. 11, pp. 610- 628 ,(1999) , 10.1109/69.790816
N. Ashish, C.A. Knoblock, Semi-automatic wrapper generation for Internet information sources cooperative information systems. pp. 160- 169 ,(1997) , 10.1109/COOPIS.1997.613813
Stephen Soderland, Learning to extract text-based information from the World Wide Web knowledge discovery and data mining. pp. 251- 254 ,(1997)
Brad Adelberg, NoDoSE---a tool for semi-automatically extracting structured and semistructured data from text documents Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98. ,vol. 27, pp. 283- 294 ,(1998) , 10.1145/276304.276330
Joachim Hammer, Héctor García-Molina, Svetlozar Nestorov, Ramana Yerneni, Marcus Breunig, Vasilis Vassalos, Template-based wrappers in the TSIMMIS system international conference on management of data. ,vol. 26, pp. 532- 535 ,(1997) , 10.1145/253260.253395
Arnaud Sahuguet, Fabien Azavant, WysiWyg Web Wrapper Factory (W4F) ,(1999)