DSP: Schema Design for Non-Relational Applications
The way a database schema is designed has a high impact on its performance in relational databases, which are symmetric in nature. While the problem of schema optimization is even more significant for NoSQL (“Not only SQL”) databases, existing modeling tools for relational databases are inadequate f...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-10-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-8994/12/11/1799 |
_version_ | 1797549328800153600 |
---|---|
author | Abdullahi Abubakar Imam Shuib Basri Rohiza Ahmad Amirudin A. Wahab María T. González-Aparicio Luiz Fernando Capretz Ammar K. Alazzawi Abdullateef O. Balogun |
author_facet | Abdullahi Abubakar Imam Shuib Basri Rohiza Ahmad Amirudin A. Wahab María T. González-Aparicio Luiz Fernando Capretz Ammar K. Alazzawi Abdullateef O. Balogun |
author_sort | Abdullahi Abubakar Imam |
collection | DOAJ |
description | The way a database schema is designed has a high impact on its performance in relational databases, which are symmetric in nature. While the problem of schema optimization is even more significant for NoSQL (“Not only SQL”) databases, existing modeling tools for relational databases are inadequate for this asymmetric setting. As a result, NoSQL modelers rely on rules of thumb to model schemas that require a high level of competence. Several studies have been conducted to address this problem; however, they are either proprietary, symmetrical, relationally dependent or post-design assessment tools. In this study, a Dynamic Schema Proposition (DSP) model for NoSQL databases is proposed to handle the asymmetric nature of today’s data. This model aims to facilitate database design and improve its performance in relation to data availability. To achieve this, data modeling styles were aggregated and classified. Existing cardinality notations were empirically extended using synthetically generated queries. A binary integer formulation was used to guide the mapping of asymmetric entities from the application’s conceptual data model to a database schema. An experiment was conducted to evaluate the impact of the DSP model on NoSQL schema production and its performance. A profound improvement was observed in read/write query performance and schema production complexities. In this regard, DSP has significant potential to produce schemas that are capable of handling big data efficiently. |
first_indexed | 2024-03-10T15:13:08Z |
format | Article |
id | doaj.art-9875518de9be46afb949014c299ce19a |
institution | Directory Open Access Journal |
issn | 2073-8994 |
language | English |
last_indexed | 2024-03-10T15:13:08Z |
publishDate | 2020-10-01 |
publisher | MDPI AG |
record_format | Article |
series | Symmetry |
spelling | doaj.art-9875518de9be46afb949014c299ce19a2023-11-20T19:10:43ZengMDPI AGSymmetry2073-89942020-10-011211179910.3390/sym12111799DSP: Schema Design for Non-Relational ApplicationsAbdullahi Abubakar Imam0Shuib Basri1Rohiza Ahmad2Amirudin A. Wahab3María T. González-Aparicio4Luiz Fernando Capretz5Ammar K. Alazzawi6Abdullateef O. Balogun7Department of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaCybersecurity HQ Malaysia, Menara Cyber Axis, Jalan Impact, Cyberjaya 63000, MalaysiaComputing Department, University of Oviedo, 3, 33003 Gijon, SpainDepartment of Electrical and Computer Engineering, Western University, London, ON N6A5B9, CanadaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaThe way a database schema is designed has a high impact on its performance in relational databases, which are symmetric in nature. While the problem of schema optimization is even more significant for NoSQL (“Not only SQL”) databases, existing modeling tools for relational databases are inadequate for this asymmetric setting. As a result, NoSQL modelers rely on rules of thumb to model schemas that require a high level of competence. Several studies have been conducted to address this problem; however, they are either proprietary, symmetrical, relationally dependent or post-design assessment tools. In this study, a Dynamic Schema Proposition (DSP) model for NoSQL databases is proposed to handle the asymmetric nature of today’s data. This model aims to facilitate database design and improve its performance in relation to data availability. To achieve this, data modeling styles were aggregated and classified. Existing cardinality notations were empirically extended using synthetically generated queries. A binary integer formulation was used to guide the mapping of asymmetric entities from the application’s conceptual data model to a database schema. An experiment was conducted to evaluate the impact of the DSP model on NoSQL schema production and its performance. A profound improvement was observed in read/write query performance and schema production complexities. In this regard, DSP has significant potential to produce schemas that are capable of handling big data efficiently.https://www.mdpi.com/2073-8994/12/11/1799big data applicationsNoSQL databasesquery processingschema optimization |
spellingShingle | Abdullahi Abubakar Imam Shuib Basri Rohiza Ahmad Amirudin A. Wahab María T. González-Aparicio Luiz Fernando Capretz Ammar K. Alazzawi Abdullateef O. Balogun DSP: Schema Design for Non-Relational Applications Symmetry big data applications NoSQL databases query processing schema optimization |
title | DSP: Schema Design for Non-Relational Applications |
title_full | DSP: Schema Design for Non-Relational Applications |
title_fullStr | DSP: Schema Design for Non-Relational Applications |
title_full_unstemmed | DSP: Schema Design for Non-Relational Applications |
title_short | DSP: Schema Design for Non-Relational Applications |
title_sort | dsp schema design for non relational applications |
topic | big data applications NoSQL databases query processing schema optimization |
url | https://www.mdpi.com/2073-8994/12/11/1799 |
work_keys_str_mv | AT abdullahiabubakarimam dspschemadesignfornonrelationalapplications AT shuibbasri dspschemadesignfornonrelationalapplications AT rohizaahmad dspschemadesignfornonrelationalapplications AT amirudinawahab dspschemadesignfornonrelationalapplications AT mariatgonzalezaparicio dspschemadesignfornonrelationalapplications AT luizfernandocapretz dspschemadesignfornonrelationalapplications AT ammarkalazzawi dspschemadesignfornonrelationalapplications AT abdullateefobalogun dspschemadesignfornonrelationalapplications |