DSP: Schema Design for Non-Relational Applications

The way a database schema is designed has a high impact on its performance in relational databases, which are symmetric in nature. While the problem of schema optimization is even more significant for NoSQL (“Not only SQL”) databases, existing modeling tools for relational databases are inadequate f...

Full description

Bibliographic Details
Main Authors: Abdullahi Abubakar Imam, Shuib Basri, Rohiza Ahmad, Amirudin A. Wahab, María T. González-Aparicio, Luiz Fernando Capretz, Ammar K. Alazzawi, Abdullateef O. Balogun
Format: Article
Language:English
Published: MDPI AG 2020-10-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/11/1799
_version_ 1797549328800153600
author Abdullahi Abubakar Imam
Shuib Basri
Rohiza Ahmad
Amirudin A. Wahab
María T. González-Aparicio
Luiz Fernando Capretz
Ammar K. Alazzawi
Abdullateef O. Balogun
author_facet Abdullahi Abubakar Imam
Shuib Basri
Rohiza Ahmad
Amirudin A. Wahab
María T. González-Aparicio
Luiz Fernando Capretz
Ammar K. Alazzawi
Abdullateef O. Balogun
author_sort Abdullahi Abubakar Imam
collection DOAJ
description The way a database schema is designed has a high impact on its performance in relational databases, which are symmetric in nature. While the problem of schema optimization is even more significant for NoSQL (“Not only SQL”) databases, existing modeling tools for relational databases are inadequate for this asymmetric setting. As a result, NoSQL modelers rely on rules of thumb to model schemas that require a high level of competence. Several studies have been conducted to address this problem; however, they are either proprietary, symmetrical, relationally dependent or post-design assessment tools. In this study, a Dynamic Schema Proposition (DSP) model for NoSQL databases is proposed to handle the asymmetric nature of today’s data. This model aims to facilitate database design and improve its performance in relation to data availability. To achieve this, data modeling styles were aggregated and classified. Existing cardinality notations were empirically extended using synthetically generated queries. A binary integer formulation was used to guide the mapping of asymmetric entities from the application’s conceptual data model to a database schema. An experiment was conducted to evaluate the impact of the DSP model on NoSQL schema production and its performance. A profound improvement was observed in read/write query performance and schema production complexities. In this regard, DSP has significant potential to produce schemas that are capable of handling big data efficiently.
first_indexed 2024-03-10T15:13:08Z
format Article
id doaj.art-9875518de9be46afb949014c299ce19a
institution Directory Open Access Journal
issn 2073-8994
language English
last_indexed 2024-03-10T15:13:08Z
publishDate 2020-10-01
publisher MDPI AG
record_format Article
series Symmetry
spelling doaj.art-9875518de9be46afb949014c299ce19a2023-11-20T19:10:43ZengMDPI AGSymmetry2073-89942020-10-011211179910.3390/sym12111799DSP: Schema Design for Non-Relational ApplicationsAbdullahi Abubakar Imam0Shuib Basri1Rohiza Ahmad2Amirudin A. Wahab3María T. González-Aparicio4Luiz Fernando Capretz5Ammar K. Alazzawi6Abdullateef O. Balogun7Department of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaCybersecurity HQ Malaysia, Menara Cyber Axis, Jalan Impact, Cyberjaya 63000, MalaysiaComputing Department, University of Oviedo, 3, 33003 Gijon, SpainDepartment of Electrical and Computer Engineering, Western University, London, ON N6A5B9, CanadaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaDepartment of Computer and Information Science, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, MalaysiaThe way a database schema is designed has a high impact on its performance in relational databases, which are symmetric in nature. While the problem of schema optimization is even more significant for NoSQL (“Not only SQL”) databases, existing modeling tools for relational databases are inadequate for this asymmetric setting. As a result, NoSQL modelers rely on rules of thumb to model schemas that require a high level of competence. Several studies have been conducted to address this problem; however, they are either proprietary, symmetrical, relationally dependent or post-design assessment tools. In this study, a Dynamic Schema Proposition (DSP) model for NoSQL databases is proposed to handle the asymmetric nature of today’s data. This model aims to facilitate database design and improve its performance in relation to data availability. To achieve this, data modeling styles were aggregated and classified. Existing cardinality notations were empirically extended using synthetically generated queries. A binary integer formulation was used to guide the mapping of asymmetric entities from the application’s conceptual data model to a database schema. An experiment was conducted to evaluate the impact of the DSP model on NoSQL schema production and its performance. A profound improvement was observed in read/write query performance and schema production complexities. In this regard, DSP has significant potential to produce schemas that are capable of handling big data efficiently.https://www.mdpi.com/2073-8994/12/11/1799big data applicationsNoSQL databasesquery processingschema optimization
spellingShingle Abdullahi Abubakar Imam
Shuib Basri
Rohiza Ahmad
Amirudin A. Wahab
María T. González-Aparicio
Luiz Fernando Capretz
Ammar K. Alazzawi
Abdullateef O. Balogun
DSP: Schema Design for Non-Relational Applications
Symmetry
big data applications
NoSQL databases
query processing
schema optimization
title DSP: Schema Design for Non-Relational Applications
title_full DSP: Schema Design for Non-Relational Applications
title_fullStr DSP: Schema Design for Non-Relational Applications
title_full_unstemmed DSP: Schema Design for Non-Relational Applications
title_short DSP: Schema Design for Non-Relational Applications
title_sort dsp schema design for non relational applications
topic big data applications
NoSQL databases
query processing
schema optimization
url https://www.mdpi.com/2073-8994/12/11/1799
work_keys_str_mv AT abdullahiabubakarimam dspschemadesignfornonrelationalapplications
AT shuibbasri dspschemadesignfornonrelationalapplications
AT rohizaahmad dspschemadesignfornonrelationalapplications
AT amirudinawahab dspschemadesignfornonrelationalapplications
AT mariatgonzalezaparicio dspschemadesignfornonrelationalapplications
AT luizfernandocapretz dspschemadesignfornonrelationalapplications
AT ammarkalazzawi dspschemadesignfornonrelationalapplications
AT abdullateefobalogun dspschemadesignfornonrelationalapplications