DNA Sequences Compression by GP² R and Selective Encryption Using Modified RSA Technique

Humans, by nature, have always been fascinated by the possibility of being able to acquire more information in minimum possible time and space. The effective lossless compression method, effective data structure, and DNA (Deoxyribonucleic Acid) data searching are quite essential as they provide a st...

Full description

Bibliographic Details
Main Authors: Syed Mahamud Hossein, Debashis De, Pradeep Kumar Das Mohapatra, Sankar Prasad Mondal, Ali Ahmadian, Ferial Ghaemi, Norazak Senu
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9057492/
_version_ 1819150132877197312
author Syed Mahamud Hossein
Debashis De
Pradeep Kumar Das Mohapatra
Sankar Prasad Mondal
Ali Ahmadian
Ferial Ghaemi
Norazak Senu
author_facet Syed Mahamud Hossein
Debashis De
Pradeep Kumar Das Mohapatra
Sankar Prasad Mondal
Ali Ahmadian
Ferial Ghaemi
Norazak Senu
author_sort Syed Mahamud Hossein
collection DOAJ
description Humans, by nature, have always been fascinated by the possibility of being able to acquire more information in minimum possible time and space. The effective lossless compression method, effective data structure, and DNA (Deoxyribonucleic Acid) data searching are quite essential as they provide a stimulus to easy accessibility and communication. The proposed algorithm is a new Lossless Compression algorithm, which compresses data, based on two tiers. Firstly, it searches for the exact Genetic Palindrome(GP), Palindrome(P) and Reverse(R)[GP<sup>2</sup>R] and the substring is reported, which is replaced by the corresponding ASCII character creating a Library file. By using the ASCII code, the Library file acts as a signature as well as provides the security of data. Secondly, modified RSA technique is proposed for the selection encryption purpose. This selection encryption of the modified RSA technique is an approach to lessen computational resources for greatly sized DNA facts. The experimental work shows 44% to 45% original sequence is encrypted where above 95% of the original file is damaged by using this method. This technique can find out the 3.851273 bits per base of the compression rate. The O(n) is the complexity of this algorithm. The running time is a few seconds of this algorithm. This is a hybrid approach to the compression &amp; encryption process. For reducing the compression rate, the first pass output is again compressed by the second pass but it is lossy, This experiment is performed on benchmark DNA order.
first_indexed 2024-12-22T14:12:39Z
format Article
id doaj.art-a22e23fc9d234e45adb2951f641493f5
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-22T14:12:39Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-a22e23fc9d234e45adb2951f641493f52022-12-21T18:23:10ZengIEEEIEEE Access2169-35362020-01-018768807689510.1109/ACCESS.2020.29857339057492DNA Sequences Compression by GP&#x00B2; R and Selective Encryption Using Modified RSA TechniqueSyed Mahamud Hossein0Debashis De1https://orcid.org/0000-0002-9688-9806Pradeep Kumar Das Mohapatra2Sankar Prasad Mondal3https://orcid.org/0000-0003-4690-2598Ali Ahmadian4https://orcid.org/0000-0002-0106-7050Ferial Ghaemi5https://orcid.org/0000-0003-2987-218XNorazak Senu6Department of Computer Science, Vidyasagar University, Midnapore, IndiaDepartment of Computer Science and Engineering, Maulana Abul Kalam Azad University of Technology, Nadia, IndiaDepartment of Microbiology, Raiganj University, Raiganj, IndiaDepartment of Applied Science, Maulana Abul Kalam Azad University of Technology, Nadia, IndiaInstitute of Industry Revolution 4.0, National University of Malaysia, Selangor, MalaysiaFaculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia (UKM), Selangor, MalaysiaInstitute for Mathematical Research (INSPEM), Universiti Putra Malaysia (UPM), Selangor, MalaysiaHumans, by nature, have always been fascinated by the possibility of being able to acquire more information in minimum possible time and space. The effective lossless compression method, effective data structure, and DNA (Deoxyribonucleic Acid) data searching are quite essential as they provide a stimulus to easy accessibility and communication. The proposed algorithm is a new Lossless Compression algorithm, which compresses data, based on two tiers. Firstly, it searches for the exact Genetic Palindrome(GP), Palindrome(P) and Reverse(R)[GP<sup>2</sup>R] and the substring is reported, which is replaced by the corresponding ASCII character creating a Library file. By using the ASCII code, the Library file acts as a signature as well as provides the security of data. Secondly, modified RSA technique is proposed for the selection encryption purpose. This selection encryption of the modified RSA technique is an approach to lessen computational resources for greatly sized DNA facts. The experimental work shows 44% to 45% original sequence is encrypted where above 95% of the original file is damaged by using this method. This technique can find out the 3.851273 bits per base of the compression rate. The O(n) is the complexity of this algorithm. The running time is a few seconds of this algorithm. This is a hybrid approach to the compression &amp; encryption process. For reducing the compression rate, the first pass output is again compressed by the second pass but it is lossy, This experiment is performed on benchmark DNA order.https://ieeexplore.ieee.org/document/9057492/Reversegenetic palindromepalindromecompressionratiorate
spellingShingle Syed Mahamud Hossein
Debashis De
Pradeep Kumar Das Mohapatra
Sankar Prasad Mondal
Ali Ahmadian
Ferial Ghaemi
Norazak Senu
DNA Sequences Compression by GP&#x00B2; R and Selective Encryption Using Modified RSA Technique
IEEE Access
Reverse
genetic palindrome
palindrome
compression
ratio
rate
title DNA Sequences Compression by GP&#x00B2; R and Selective Encryption Using Modified RSA Technique
title_full DNA Sequences Compression by GP&#x00B2; R and Selective Encryption Using Modified RSA Technique
title_fullStr DNA Sequences Compression by GP&#x00B2; R and Selective Encryption Using Modified RSA Technique
title_full_unstemmed DNA Sequences Compression by GP&#x00B2; R and Selective Encryption Using Modified RSA Technique
title_short DNA Sequences Compression by GP&#x00B2; R and Selective Encryption Using Modified RSA Technique
title_sort dna sequences compression by gp x00b2 r and selective encryption using modified rsa technique
topic Reverse
genetic palindrome
palindrome
compression
ratio
rate
url https://ieeexplore.ieee.org/document/9057492/
work_keys_str_mv AT syedmahamudhossein dnasequencescompressionbygpx00b2randselectiveencryptionusingmodifiedrsatechnique
AT debashisde dnasequencescompressionbygpx00b2randselectiveencryptionusingmodifiedrsatechnique
AT pradeepkumardasmohapatra dnasequencescompressionbygpx00b2randselectiveencryptionusingmodifiedrsatechnique
AT sankarprasadmondal dnasequencescompressionbygpx00b2randselectiveencryptionusingmodifiedrsatechnique
AT aliahmadian dnasequencescompressionbygpx00b2randselectiveencryptionusingmodifiedrsatechnique
AT ferialghaemi dnasequencescompressionbygpx00b2randselectiveencryptionusingmodifiedrsatechnique
AT norazaksenu dnasequencescompressionbygpx00b2randselectiveencryptionusingmodifiedrsatechnique