A Novel Automatic Relational Database Normalization Method
The increase in data diversity and the fact that database design is a difficult process make it practically impossible to design a unique database schema for all datasets encountered. In this paper, we introduce a fully automatic genetic algorithm-based relational database normalization method for r...
Main Authors: | , |
---|---|
Format: | Article |
Language: | ces |
Published: |
Prague University of Economics and Business
2022-12-01
|
Series: | Acta Informatica Pragensia |
Subjects: | |
Online Access: | https://aip.vse.cz/artkey/aip-202203-0002_a-novel-automatic-relational-database-normalization-method.php |
_version_ | 1797861447482474496 |
---|---|
author | Emre Akadal Mehmet Hakan Satman |
author_facet | Emre Akadal Mehmet Hakan Satman |
author_sort | Emre Akadal |
collection | DOAJ |
description | The increase in data diversity and the fact that database design is a difficult process make it practically impossible to design a unique database schema for all datasets encountered. In this paper, we introduce a fully automatic genetic algorithm-based relational database normalization method for revealing the right database schema using a raw dataset and without the need for any prior knowledge. For measuring the performance of the algorithm, we perform a simulation study using 250 datasets produced using 50 well-known databases. A total of 2500 simulations are carried out, ten times for each of five denormalized variations of all database designs containing different synthetic contents. The results of the simulation study show that the proposed algorithm discovers exactly 72% of the unknown database schemas. The performance can be improved by fine-tuning the optimization parameters. The results of the simulation study also show that the devised algorithm can be used in many datasets to reveal structs of databases when only a raw dataset is available at hand. |
first_indexed | 2024-04-09T22:02:36Z |
format | Article |
id | doaj.art-d91b15e8f8074f8f90eb532c1545981d |
institution | Directory Open Access Journal |
issn | 1805-4951 |
language | ces |
last_indexed | 2024-04-09T22:02:36Z |
publishDate | 2022-12-01 |
publisher | Prague University of Economics and Business |
record_format | Article |
series | Acta Informatica Pragensia |
spelling | doaj.art-d91b15e8f8074f8f90eb532c1545981d2023-03-23T14:16:15ZcesPrague University of Economics and BusinessActa Informatica Pragensia1805-49512022-12-0111329330810.18267/j.aip.193aip-202203-0002A Novel Automatic Relational Database Normalization MethodEmre Akadal0Mehmet Hakan Satman1Department of Management Information Systems, Faculty of Economics, Istanbul University, Beyazit Kampüsü, 34116 Fatih/İstanbul, TurkeyDepartment of Econometrics, Faculty of Economics, Istanbul University, Beyazit Kampüsü, 34116 Fatih/İstanbul, TurkeyThe increase in data diversity and the fact that database design is a difficult process make it practically impossible to design a unique database schema for all datasets encountered. In this paper, we introduce a fully automatic genetic algorithm-based relational database normalization method for revealing the right database schema using a raw dataset and without the need for any prior knowledge. For measuring the performance of the algorithm, we perform a simulation study using 250 datasets produced using 50 well-known databases. A total of 2500 simulations are carried out, ten times for each of five denormalized variations of all database designs containing different synthetic contents. The results of the simulation study show that the proposed algorithm discovers exactly 72% of the unknown database schemas. The performance can be improved by fine-tuning the optimization parameters. The results of the simulation study also show that the devised algorithm can be used in many datasets to reveal structs of databases when only a raw dataset is available at hand.https://aip.vse.cz/artkey/aip-202203-0002_a-novel-automatic-relational-database-normalization-method.phprelational databasesautomatic normalizationgenetic algorithmsoptimizationdecision support |
spellingShingle | Emre Akadal Mehmet Hakan Satman A Novel Automatic Relational Database Normalization Method Acta Informatica Pragensia relational databases automatic normalization genetic algorithms optimization decision support |
title | A Novel Automatic Relational Database Normalization Method |
title_full | A Novel Automatic Relational Database Normalization Method |
title_fullStr | A Novel Automatic Relational Database Normalization Method |
title_full_unstemmed | A Novel Automatic Relational Database Normalization Method |
title_short | A Novel Automatic Relational Database Normalization Method |
title_sort | novel automatic relational database normalization method |
topic | relational databases automatic normalization genetic algorithms optimization decision support |
url | https://aip.vse.cz/artkey/aip-202203-0002_a-novel-automatic-relational-database-normalization-method.php |
work_keys_str_mv | AT emreakadal anovelautomaticrelationaldatabasenormalizationmethod AT mehmethakansatman anovelautomaticrelationaldatabasenormalizationmethod AT emreakadal novelautomaticrelationaldatabasenormalizationmethod AT mehmethakansatman novelautomaticrelationaldatabasenormalizationmethod |