Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers

The concept of edit distance and its variants has applications in many areas such as computational linguistics, bioinformatics, and synchronization error detection in data communications. Here, we revisit the problem of computing the inner edit distance of a regular language given via a Nondetermini...

Full description

Bibliographic Details
Main Authors: Lila Kari, Stavros Konstantinidis, Steffen Kopecki, Meng Yang
Format: Article
Language:English
Published: MDPI AG 2018-10-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/11/11/165
_version_ 1811332686121271296
author Lila Kari
Stavros Konstantinidis
Steffen Kopecki
Meng Yang
author_facet Lila Kari
Stavros Konstantinidis
Steffen Kopecki
Meng Yang
author_sort Lila Kari
collection DOAJ
description The concept of edit distance and its variants has applications in many areas such as computational linguistics, bioinformatics, and synchronization error detection in data communications. Here, we revisit the problem of computing the inner edit distance of a regular language given via a Nondeterministic Finite Automaton (NFA). This problem relates to the inherent maximal error-detecting capability of the language in question. We present two efficient algorithms for solving this problem, both of which execute in time <inline-formula> <math display="inline"> <semantics> <mrow> <mi>O</mi> <mo stretchy="false">(</mo> <msup> <mi>r</mi> <mn>2</mn> </msup> <msup> <mi>n</mi> <mn>2</mn> </msup> <mi>d</mi> <mo stretchy="false">)</mo> </mrow> </semantics> </math> </inline-formula>, where <i>r</i> is the cardinality of the alphabet involved, <i>n</i> is the number of transitions in the given NFA, and <i>d</i> is the computed edit distance. We have implemented one of the two algorithms and present here a set of performance tests. The correctness of the algorithms is based on the connection between word distances and error detection and the fact that nondeterministic transducers can be used to represent the errors (resp., edit operations) involved in error-detection (resp., in word distances).
first_indexed 2024-04-13T16:40:42Z
format Article
id doaj.art-28377fedee3d48e3aa1831e44380f44c
institution Directory Open Access Journal
issn 1999-4893
language English
last_indexed 2024-04-13T16:40:42Z
publishDate 2018-10-01
publisher MDPI AG
record_format Article
series Algorithms
spelling doaj.art-28377fedee3d48e3aa1831e44380f44c2022-12-22T02:39:14ZengMDPI AGAlgorithms1999-48932018-10-01111116510.3390/a11110165a11110165Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via TransducersLila Kari0Stavros Konstantinidis1Steffen Kopecki2Meng Yang3School of Computer Science, University of Waterloo, Waterloo, ON N2L 3G1, CanadaDepartment of Mathematics and Computing Science, Saint Mary’s University, Halifax, NS B3H 3C3, CanadaDepartment of Mathematics and Computing Science, Saint Mary’s University, Halifax, NS B3H 3C3, CanadaDepartment of Mathematics and Computing Science, Saint Mary’s University, Halifax, NS B3H 3C3, CanadaThe concept of edit distance and its variants has applications in many areas such as computational linguistics, bioinformatics, and synchronization error detection in data communications. Here, we revisit the problem of computing the inner edit distance of a regular language given via a Nondeterministic Finite Automaton (NFA). This problem relates to the inherent maximal error-detecting capability of the language in question. We present two efficient algorithms for solving this problem, both of which execute in time <inline-formula> <math display="inline"> <semantics> <mrow> <mi>O</mi> <mo stretchy="false">(</mo> <msup> <mi>r</mi> <mn>2</mn> </msup> <msup> <mi>n</mi> <mn>2</mn> </msup> <mi>d</mi> <mo stretchy="false">)</mo> </mrow> </semantics> </math> </inline-formula>, where <i>r</i> is the cardinality of the alphabet involved, <i>n</i> is the number of transitions in the given NFA, and <i>d</i> is the computed edit distance. We have implemented one of the two algorithms and present here a set of performance tests. The correctness of the algorithms is based on the connection between word distances and error detection and the fact that nondeterministic transducers can be used to represent the errors (resp., edit operations) involved in error-detection (resp., in word distances).https://www.mdpi.com/1999-4893/11/11/165algorithmsautomatacomplexityedit distanceimplementationtransducersregular language
spellingShingle Lila Kari
Stavros Konstantinidis
Steffen Kopecki
Meng Yang
Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers
Algorithms
algorithms
automata
complexity
edit distance
implementation
transducers
regular language
title Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers
title_full Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers
title_fullStr Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers
title_full_unstemmed Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers
title_short Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers
title_sort efficient algorithms for computing the inner edit distance of a regular language via transducers
topic algorithms
automata
complexity
edit distance
implementation
transducers
regular language
url https://www.mdpi.com/1999-4893/11/11/165
work_keys_str_mv AT lilakari efficientalgorithmsforcomputingtheinnereditdistanceofaregularlanguageviatransducers
AT stavroskonstantinidis efficientalgorithmsforcomputingtheinnereditdistanceofaregularlanguageviatransducers
AT steffenkopecki efficientalgorithmsforcomputingtheinnereditdistanceofaregularlanguageviatransducers
AT mengyang efficientalgorithmsforcomputingtheinnereditdistanceofaregularlanguageviatransducers