Similarity of DTDs Based on Edit Distance and Semantics

作者: Aleš Wojnar , Irena Mlýnková , Jiří Dokulil , None

DOI: 10.1007/978-3-540-85257-5_21

关键词:

摘要: In this paper we propose a technique for evaluating similarity of XML schema fragments. Contrary to existing works focus on structural level in combination with semantic the data. For purpose exploit idea edit distance utilized constructs DTDs which enables express differences given data more precisely. addition, it provides realistic results. Using various experiments show behavior and advantages proposed approach.

参考文章(12)
Irena Mlynkova, Jaroslav Pokorný, Kamil Toman, Statistical Analysis of Real XML Data Collections conference on management of data. pp. 15- 26 ,(2006)
Patrick K. L. Ng, Vincent T. Y. Ng, Structural similarity between XML documents and DTDs international conference on computational science. pp. 412- 421 ,(2003) , 10.1007/3-540-44863-2_41
H. V. Jagadish, Andrew Nierman, Evaluating Structural Similarity in XML Documents international workshop on the web and databases. pp. 61- 66 ,(2002)
V. I. Levenshtein, Binary codes capable of correcting deletions, insertions, and reversals Soviet physics. Doklady. ,vol. 10, pp. 707- 710 ,(1966)
Rong Li, Zhongping Zhang, Shunliang Cao, Yangyong Zhu, Similarity Metric for XML Documents ,(2003)
Mehmet Altinel, Michael J. Franklin, Efficient Filtering of XML Documents for Selective Dissemination of Information very large data bases. pp. 53- 64 ,(2000)
Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, Extensible Markup Language (XML). World Wide Web. ,vol. 2, pp. 27- 66 ,(1997)
Elisa Bertino, Giovanna Guerrini, Marco Mesiti, A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications Information Systems. ,vol. 29, pp. 23- 46 ,(2004) , 10.1016/S0306-4379(03)00031-0
Tova Milo, Sagit Zohar, Using Schema Matching to Simplify Heterogeneous Data Translation very large data bases. pp. 122- 133 ,(1998)
Hong-Hai Do, Erhard Rahm, COMA: a system for flexible combination of schema matching approaches very large data bases. pp. 610- 621 ,(2002) , 10.1016/B978-155860869-6/50060-3