Parsing Expression Grammars and Their Induction Algorithm

Grammatical inference (GI), i.e., the task of finding a rule that lies behind given words, can be used in the analyses of amyloidogenic sequence fragments, which are essential in studies of neurodegenerative diseases. In this paper, we developed a new method that generates non-circular parsing expre...

Full description

Bibliographic Details
Main Authors: Wojciech Wieczorek, Olgierd Unold, Łukasz Strąk
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/23/8747
_version_ 1797545519440986112
author Wojciech Wieczorek
Olgierd Unold
Łukasz Strąk
author_facet Wojciech Wieczorek
Olgierd Unold
Łukasz Strąk
author_sort Wojciech Wieczorek
collection DOAJ
description Grammatical inference (GI), i.e., the task of finding a rule that lies behind given words, can be used in the analyses of amyloidogenic sequence fragments, which are essential in studies of neurodegenerative diseases. In this paper, we developed a new method that generates non-circular parsing expression grammars (PEGs) and compares it with other GI algorithms on the sequences from a real dataset. The main contribution of this paper is a genetic programming-based algorithm for the induction of parsing expression grammars from a finite sample. The induction method has been tested on a real bioinformatics dataset and its classification performance has been compared to the achievements of existing grammatical inference methods. The evaluation of the generated PEG on an amyloidogenic dataset revealed its accuracy when predicting amyloid segments. We show that the new grammatical inference algorithm achieves the best ACC (Accuracy), AUC (Area under ROC curve), and MCC (Mathew’s correlation coefficient) scores in comparison to five other automata or grammar learning methods.
first_indexed 2024-03-10T14:16:37Z
format Article
id doaj.art-79989ce4ee7241fb924ca0a6e4bab5b1
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T14:16:37Z
publishDate 2020-12-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-79989ce4ee7241fb924ca0a6e4bab5b12023-11-20T23:44:42ZengMDPI AGApplied Sciences2076-34172020-12-011023874710.3390/app10238747Parsing Expression Grammars and Their Induction AlgorithmWojciech Wieczorek0Olgierd Unold1Łukasz Strąk2Department of Computer Science and Automatics, University of Bielsko-Biala, Willowa 2, 43-309 Bielsko-Biala, PolandDepartment of Computer Engineering, Wroclaw University of Science and Technology, Wyb. Wyspianskiego 27, 50-370 Wroclaw, PolandFaculty of Science and Technology, University of Silesia in Katowice, Bankowa 14, 40-007 Katowice, PolandGrammatical inference (GI), i.e., the task of finding a rule that lies behind given words, can be used in the analyses of amyloidogenic sequence fragments, which are essential in studies of neurodegenerative diseases. In this paper, we developed a new method that generates non-circular parsing expression grammars (PEGs) and compares it with other GI algorithms on the sequences from a real dataset. The main contribution of this paper is a genetic programming-based algorithm for the induction of parsing expression grammars from a finite sample. The induction method has been tested on a real bioinformatics dataset and its classification performance has been compared to the achievements of existing grammatical inference methods. The evaluation of the generated PEG on an amyloidogenic dataset revealed its accuracy when predicting amyloid segments. We show that the new grammatical inference algorithm achieves the best ACC (Accuracy), AUC (Area under ROC curve), and MCC (Mathew’s correlation coefficient) scores in comparison to five other automata or grammar learning methods.https://www.mdpi.com/2076-3417/10/23/8747classificationgenetic programminggrammatical inferenceparsing expression grammar
spellingShingle Wojciech Wieczorek
Olgierd Unold
Łukasz Strąk
Parsing Expression Grammars and Their Induction Algorithm
Applied Sciences
classification
genetic programming
grammatical inference
parsing expression grammar
title Parsing Expression Grammars and Their Induction Algorithm
title_full Parsing Expression Grammars and Their Induction Algorithm
title_fullStr Parsing Expression Grammars and Their Induction Algorithm
title_full_unstemmed Parsing Expression Grammars and Their Induction Algorithm
title_short Parsing Expression Grammars and Their Induction Algorithm
title_sort parsing expression grammars and their induction algorithm
topic classification
genetic programming
grammatical inference
parsing expression grammar
url https://www.mdpi.com/2076-3417/10/23/8747
work_keys_str_mv AT wojciechwieczorek parsingexpressiongrammarsandtheirinductionalgorithm
AT olgierdunold parsingexpressiongrammarsandtheirinductionalgorithm
AT łukaszstrak parsingexpressiongrammarsandtheirinductionalgorithm