Abstract

The problem of deriving lower and upper bounds for the edit distance between undirected, labeled graphs has recently received increasing attention. However, only one algorithm has been proposed that allegedly computes not only an upper but also a lower bound for non-uniform edit costs and incorporates information about both node and edge labels. In this paper, we demonstrate that this algorithm is incorrect. We present a corrected version $\mathsf {B\scriptstyle{RANCH}}$ that runs in $\mathcal{O}(n^2\Delta ^3+n^3)$ time, where $\Delta$ is the maximum of the maximum degrees of input graphs $G$ and $H$ . We also develop a speed-up $\mathsf {B\scriptstyle{RANCH}}\mathsf{F\scriptstyle{AST}}$ that runs in $\mathcal{O}(n^2\Delta ^2+n^3)$ time and computes an only slightly less accurate lower bound. The lower bounds produced by $\mathsf {B\scriptstyle{RANCH}}$ and $\mathsf {B\scriptstyle{RANCH}}\mathsf{F\scriptstyle{AST}}$ are shown to be pseudo-metrics on a collection of graphs. Finally, we suggest an anytime algorithm $\mathsf {B\scriptstyle{RANCH}}\mathsf{T\scriptstyle{IGHT}}$ that iteratively improves $\mathsf {B\scriptstyle{RANCH}}$ ’s lower bound. $\mathsf {B\scriptstyle{RANCH}}\mathsf{T\scriptstyle{IGHT}}$ runs in $\mathcal{O}(n^3\Delta ^2+I(n^2\Delta ^3+n^3))$ time, where the number of iterations $I$ is controlled by the user. A detailed experimental evaluation shows that all suggested algorithms are Pareto optimal, that they are very effective when used as filters for edit distance range queries, and that they perform excellently when used within classification frameworks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.