RTED

作者: Nikolaus Augsten , Mateusz Pawlik

DOI: 10.14778/2095686.2095692

关键词:

摘要: We consider the classical tree edit distance between ordered labeled trees, which is defined as minimum-cost sequence of node operations that transform one into another. The state-of-the-art solutions for are not satisfactory. main competitors in field either have optimal worst-case complexity, but worst case happens frequently, or they very efficient some shapes, degenerate others. This leads to unpredictable and often infeasible runtimes. There no obvious way choose algorithms.In this paper we present RTED, a robust algorithm. asymptotic complexity RTED smaller equal best any input instance, i.e., both optimal. introduce class LRH (Left-Right-Heavy) algorithms, includes fastest algorithms presented literature. prove outperforms all previously proposed terms runtime complexity. In our experiments on synthetic real world data empirically evaluate solution compare it state-of-the-art.

参考文章(33)
Zhiwei Lin, Hui Wang, Sally McClean, Measuring Tree Similarity for Natural Language Processing Based Information Retrieval Natural Language Processing and Information Systems. pp. 13- 23 ,(2010) , 10.1007/978-3-642-13881-2_2
Shihyen Chen, Kaizhong Zhang, An improved algorithm for tree edit distance incorporating structural linearity computing and combinatorics conference. pp. 482- 492 ,(2007) , 10.1007/978-3-540-73545-8_47
Amaury Habrard, José Manuel Iñesta, David Rizo, Marc Sebban, Melody Recognition with Learned Edit Distances SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition. pp. 86- 96 ,(2008) , 10.1007/978-3-540-89689-0_13
Philip N. Klein, Computing the Edit-Distance between Unrooted Ordered Trees european symposium on algorithms. pp. 91- 102 ,(1998) , 10.1007/3-540-68530-8_8
J. Bellando, R. Kothari, Region-based modeling and tree edit distance as a basis for gesture recognition international conference on image analysis and processing. pp. 698- 703 ,(1999) , 10.1109/ICIAP.1999.797676
Tatsuya AKUTSU, Tree Edit Distance Problems: Algorithms and Applications to Bioinformatics IEICE Transactions on Information and Systems. ,vol. 93, pp. 208- 218 ,(2010) , 10.1587/TRANSINF.E93.D.208
Kuo-Chung Tai, The Tree-to-Tree Correction Problem Journal of the ACM. ,vol. 26, pp. 422- 433 ,(1979) , 10.1145/322139.322143
Kaizhong Zhang, Dennis Shasha, Simple fast algorithms for the editing distance between trees and related problems SIAM Journal on Computing. ,vol. 18, pp. 1245- 1262 ,(1989) , 10.1137/0218082
Philip Bille, A survey on tree edit distance and related problems Theoretical Computer Science. ,vol. 337, pp. 217- 239 ,(2005) , 10.1016/J.TCS.2004.12.030
Nikolaus Augsten, Michael Bohlen, Curtis Dyreson, Johann Gamper, Approximate Joins for Data-Centric XML 2008 IEEE 24th International Conference on Data Engineering. pp. 814- 823 ,(2008) , 10.1109/ICDE.2008.4497490