Efficient loss-less compression for genetic data
As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into ava...
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project (FYP) |
Language: | English |
Published: |
2016
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/66707 |
_version_ | 1826125704962506752 |
---|---|
author | Tye, Yong Meng |
author2 | Anupam Chattopadhyay |
author_facet | Anupam Chattopadhyay Tye, Yong Meng |
author_sort | Tye, Yong Meng |
collection | NTU |
description | As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into available compression algorithms which work better than other general compression tools. The first algorithm that will be examined is logic synthesis. It is an algorithm which takes in binary string as input, process it into logic circuits and then giving an optimized logic circuit as the output. This algorithm will work on a segment of DNA sequences to determine if it works well with such data. The second algorithm comes from the Fqzcomp program which won the first prize in the sequence squeeze competition because it offered the best compression ratio on DNA sequences. It will be examined and suggestions will be proposed to make it more efficient. |
first_indexed | 2024-10-01T06:41:02Z |
format | Final Year Project (FYP) |
id | ntu-10356/66707 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T06:41:02Z |
publishDate | 2016 |
record_format | dspace |
spelling | ntu-10356/667072023-03-03T20:27:50Z Efficient loss-less compression for genetic data Tye, Yong Meng Anupam Chattopadhyay School of Computer Engineering DRNTU::Engineering As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into available compression algorithms which work better than other general compression tools. The first algorithm that will be examined is logic synthesis. It is an algorithm which takes in binary string as input, process it into logic circuits and then giving an optimized logic circuit as the output. This algorithm will work on a segment of DNA sequences to determine if it works well with such data. The second algorithm comes from the Fqzcomp program which won the first prize in the sequence squeeze competition because it offered the best compression ratio on DNA sequences. It will be examined and suggestions will be proposed to make it more efficient. Bachelor of Engineering (Computer Science) 2016-04-21T07:58:42Z 2016-04-21T07:58:42Z 2016 Final Year Project (FYP) http://hdl.handle.net/10356/66707 en Nanyang Technological University 30 p. application/pdf |
spellingShingle | DRNTU::Engineering Tye, Yong Meng Efficient loss-less compression for genetic data |
title | Efficient loss-less compression for genetic data |
title_full | Efficient loss-less compression for genetic data |
title_fullStr | Efficient loss-less compression for genetic data |
title_full_unstemmed | Efficient loss-less compression for genetic data |
title_short | Efficient loss-less compression for genetic data |
title_sort | efficient loss less compression for genetic data |
topic | DRNTU::Engineering |
url | http://hdl.handle.net/10356/66707 |
work_keys_str_mv | AT tyeyongmeng efficientlosslesscompressionforgeneticdata |