Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers

The concept of edit distance and its variants has applications in many areas such as computational linguistics, bioinformatics, and synchronization error detection in data communications. Here, we revisit the problem of computing the inner edit distance of a regular language given via a Nondetermini...

Full description

Bibliographic Details
Main Authors: Lila Kari, Stavros Konstantinidis, Steffen Kopecki, Meng Yang
Format: Article
Language:English
Published: MDPI AG 2018-10-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/11/11/165
Description
Summary:The concept of edit distance and its variants has applications in many areas such as computational linguistics, bioinformatics, and synchronization error detection in data communications. Here, we revisit the problem of computing the inner edit distance of a regular language given via a Nondeterministic Finite Automaton (NFA). This problem relates to the inherent maximal error-detecting capability of the language in question. We present two efficient algorithms for solving this problem, both of which execute in time <inline-formula> <math display="inline"> <semantics> <mrow> <mi>O</mi> <mo stretchy="false">(</mo> <msup> <mi>r</mi> <mn>2</mn> </msup> <msup> <mi>n</mi> <mn>2</mn> </msup> <mi>d</mi> <mo stretchy="false">)</mo> </mrow> </semantics> </math> </inline-formula>, where <i>r</i> is the cardinality of the alphabet involved, <i>n</i> is the number of transitions in the given NFA, and <i>d</i> is the computed edit distance. We have implemented one of the two algorithms and present here a set of performance tests. The correctness of the algorithms is based on the connection between word distances and error detection and the fact that nondeterministic transducers can be used to represent the errors (resp., edit operations) involved in error-detection (resp., in word distances).
ISSN:1999-4893