Summary: | The concept of edit distance and its variants has applications in many areas such as computational linguistics, bioinformatics, and synchronization error detection in data communications. Here, we revisit the problem of computing the inner edit distance of a regular language given via a Nondeterministic Finite Automaton (NFA). This problem relates to the inherent maximal error-detecting capability of the language in question. We present two efficient algorithms for solving this problem, both of which execute in time <inline-formula> <math display="inline"> <semantics> <mrow> <mi>O</mi> <mo stretchy="false">(</mo> <msup> <mi>r</mi> <mn>2</mn> </msup> <msup> <mi>n</mi> <mn>2</mn> </msup> <mi>d</mi> <mo stretchy="false">)</mo> </mrow> </semantics> </math> </inline-formula>, where <i>r</i> is the cardinality of the alphabet involved, <i>n</i> is the number of transitions in the given NFA, and <i>d</i> is the computed edit distance. We have implemented one of the two algorithms and present here a set of performance tests. The correctness of the algorithms is based on the connection between word distances and error detection and the fact that nondeterministic transducers can be used to represent the errors (resp., edit operations) involved in error-detection (resp., in word distances).
|