A Novel Automatic Relational Database Normalization Method

The increase in data diversity and the fact that database design is a difficult process make it practically impossible to design a unique database schema for all datasets encountered. In this paper, we introduce a fully automatic genetic algorithm-based relational database normalization method for r...

Full description

Bibliographic Details
Main Authors: Emre Akadal, Mehmet Hakan Satman
Format: Article
Language:ces
Published: Prague University of Economics and Business 2022-12-01
Series:Acta Informatica Pragensia
Subjects:
Online Access:https://aip.vse.cz/artkey/aip-202203-0002_a-novel-automatic-relational-database-normalization-method.php
_version_ 1797861447482474496
author Emre Akadal
Mehmet Hakan Satman
author_facet Emre Akadal
Mehmet Hakan Satman
author_sort Emre Akadal
collection DOAJ
description The increase in data diversity and the fact that database design is a difficult process make it practically impossible to design a unique database schema for all datasets encountered. In this paper, we introduce a fully automatic genetic algorithm-based relational database normalization method for revealing the right database schema using a raw dataset and without the need for any prior knowledge. For measuring the performance of the algorithm, we perform a simulation study using 250 datasets produced using 50 well-known databases. A total of 2500 simulations are carried out, ten times for each of five denormalized variations of all database designs containing different synthetic contents. The results of the simulation study show that the proposed algorithm discovers exactly 72% of the unknown database schemas. The performance can be improved by fine-tuning the optimization parameters. The results of the simulation study also show that the devised algorithm can be used in many datasets to reveal structs of databases when only a raw dataset is available at hand.
first_indexed 2024-04-09T22:02:36Z
format Article
id doaj.art-d91b15e8f8074f8f90eb532c1545981d
institution Directory Open Access Journal
issn 1805-4951
language ces
last_indexed 2024-04-09T22:02:36Z
publishDate 2022-12-01
publisher Prague University of Economics and Business
record_format Article
series Acta Informatica Pragensia
spelling doaj.art-d91b15e8f8074f8f90eb532c1545981d2023-03-23T14:16:15ZcesPrague University of Economics and BusinessActa Informatica Pragensia1805-49512022-12-0111329330810.18267/j.aip.193aip-202203-0002A Novel Automatic Relational Database Normalization MethodEmre Akadal0Mehmet Hakan Satman1Department of Management Information Systems, Faculty of Economics, Istanbul University, Beyazit Kampüsü, 34116 Fatih/İstanbul, TurkeyDepartment of Econometrics, Faculty of Economics, Istanbul University, Beyazit Kampüsü, 34116 Fatih/İstanbul, TurkeyThe increase in data diversity and the fact that database design is a difficult process make it practically impossible to design a unique database schema for all datasets encountered. In this paper, we introduce a fully automatic genetic algorithm-based relational database normalization method for revealing the right database schema using a raw dataset and without the need for any prior knowledge. For measuring the performance of the algorithm, we perform a simulation study using 250 datasets produced using 50 well-known databases. A total of 2500 simulations are carried out, ten times for each of five denormalized variations of all database designs containing different synthetic contents. The results of the simulation study show that the proposed algorithm discovers exactly 72% of the unknown database schemas. The performance can be improved by fine-tuning the optimization parameters. The results of the simulation study also show that the devised algorithm can be used in many datasets to reveal structs of databases when only a raw dataset is available at hand.https://aip.vse.cz/artkey/aip-202203-0002_a-novel-automatic-relational-database-normalization-method.phprelational databasesautomatic normalizationgenetic algorithmsoptimizationdecision support
spellingShingle Emre Akadal
Mehmet Hakan Satman
A Novel Automatic Relational Database Normalization Method
Acta Informatica Pragensia
relational databases
automatic normalization
genetic algorithms
optimization
decision support
title A Novel Automatic Relational Database Normalization Method
title_full A Novel Automatic Relational Database Normalization Method
title_fullStr A Novel Automatic Relational Database Normalization Method
title_full_unstemmed A Novel Automatic Relational Database Normalization Method
title_short A Novel Automatic Relational Database Normalization Method
title_sort novel automatic relational database normalization method
topic relational databases
automatic normalization
genetic algorithms
optimization
decision support
url https://aip.vse.cz/artkey/aip-202203-0002_a-novel-automatic-relational-database-normalization-method.php
work_keys_str_mv AT emreakadal anovelautomaticrelationaldatabasenormalizationmethod
AT mehmethakansatman anovelautomaticrelationaldatabasenormalizationmethod
AT emreakadal novelautomaticrelationaldatabasenormalizationmethod
AT mehmethakansatman novelautomaticrelationaldatabasenormalizationmethod