Polylogarithmic Approximation for Edit Distance and the Asymmetric Query Complexity

作者: Alexandr Andoni , Robert Krauthgamer , Krzysztof Onak

DOI: 10.1109/FOCS.2010.43

关键词:

摘要: We present a near-linear time algorithm that approximates the edit distance between two strings within polylogarithmic factor. For of length $n$ and every fixed $\eps>0$, computes $(\log n)^{O(1/\eps)}$ approximation in $n^{1+\eps}$ time. This is an {\em exponential} improvement over previously known factor, $2^{\tilde O(\sqrt{\log n})}$, with comparable running [Ostrovsky Rabani, J. ACM 2007, Andoni Onak, STOC 2009]. result arises naturally study new \emph{asymmetric query} model. In this model, input consists $x$ $y$, can access $y$ unrestricted manner, while being charged for querying symbol $x$. Indeed, we obtain our main by designing makes small number queries then provide nearly-matching lower bound on queries. Our first to expose hardness stemming from ``repetitive'', which means many their substrings are approximately identical. Consequently, provides rigorous separation Ulam distance.

参考文章(41)
Thomas H. Cormen, Introduction to algorithms [2nd ed.] MIT Press. ,(2001)
TH Cormen, RL Rivest, CE Leiserson, C Stein, Introduction to Algorithms, 2nd edition. ,(2001)
Graham Cormode, Sequence distance embeddings Department of Computer Science. ,(2003)
Piotr Indyk, Ji_í Matou_ek, Low-Distortion Embeddings of Finite Metric Spaces Handbook of Discrete and Computational Geometry, Second Edition. pp. 177- 196 ,(2004) , 10.1201/9781420035315.CH8
P. Indyk, Algorithmic applications of low-distortion geometric embeddings international conference on cluster computing. pp. 10- 33 ,(2001) , 10.1109/SFCS.2001.959878
V. I. Levenshtein, Binary codes capable of correcting deletions, insertions, and reversals Soviet physics. Doklady. ,vol. 10, pp. 707- 710 ,(1966)
Rafail Ostrovsky, Yuval Rabani, Low distortion embeddings for edit distance Journal of the ACM. ,vol. 54, pp. 23- ,(2007) , 10.1145/1284320.1284322
Alexandr Andoni, Krzysztof Onak, Approximating edit distance in near-linear time Proceedings of the 41st annual ACM symposium on Symposium on theory of computing - STOC '09. pp. 199- 204 ,(2009) , 10.1145/1536414.1536444
Robert A. Wagner, Michael J. Fischer, The String-to-String Correction Problem Journal of the ACM. ,vol. 21, pp. 168- 173 ,(1974) , 10.1145/321796.321811