Efficient loss-less compression for genetic data

As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into ava...

Full description

Bibliographic Details
Main Author: Tye, Yong Meng
Other Authors: Anupam Chattopadhyay
Format: Final Year Project (FYP)
Language:English
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/10356/66707
_version_ 1826125704962506752
author Tye, Yong Meng
author2 Anupam Chattopadhyay
author_facet Anupam Chattopadhyay
Tye, Yong Meng
author_sort Tye, Yong Meng
collection NTU
description As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into available compression algorithms which work better than other general compression tools. The first algorithm that will be examined is logic synthesis. It is an algorithm which takes in binary string as input, process it into logic circuits and then giving an optimized logic circuit as the output. This algorithm will work on a segment of DNA sequences to determine if it works well with such data. The second algorithm comes from the Fqzcomp program which won the first prize in the sequence squeeze competition because it offered the best compression ratio on DNA sequences. It will be examined and suggestions will be proposed to make it more efficient.
first_indexed 2024-10-01T06:41:02Z
format Final Year Project (FYP)
id ntu-10356/66707
institution Nanyang Technological University
language English
last_indexed 2024-10-01T06:41:02Z
publishDate 2016
record_format dspace
spelling ntu-10356/667072023-03-03T20:27:50Z Efficient loss-less compression for genetic data Tye, Yong Meng Anupam Chattopadhyay School of Computer Engineering DRNTU::Engineering As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into available compression algorithms which work better than other general compression tools. The first algorithm that will be examined is logic synthesis. It is an algorithm which takes in binary string as input, process it into logic circuits and then giving an optimized logic circuit as the output. This algorithm will work on a segment of DNA sequences to determine if it works well with such data. The second algorithm comes from the Fqzcomp program which won the first prize in the sequence squeeze competition because it offered the best compression ratio on DNA sequences. It will be examined and suggestions will be proposed to make it more efficient. Bachelor of Engineering (Computer Science) 2016-04-21T07:58:42Z 2016-04-21T07:58:42Z 2016 Final Year Project (FYP) http://hdl.handle.net/10356/66707 en Nanyang Technological University 30 p. application/pdf
spellingShingle DRNTU::Engineering
Tye, Yong Meng
Efficient loss-less compression for genetic data
title Efficient loss-less compression for genetic data
title_full Efficient loss-less compression for genetic data
title_fullStr Efficient loss-less compression for genetic data
title_full_unstemmed Efficient loss-less compression for genetic data
title_short Efficient loss-less compression for genetic data
title_sort efficient loss less compression for genetic data
topic DRNTU::Engineering
url http://hdl.handle.net/10356/66707
work_keys_str_mv AT tyeyongmeng efficientlosslesscompressionforgeneticdata