Algorithms for genomics and genetics : compression-accelerated search and admixture analysis

Thesis (Ph. D.)--Massachusetts Institute of Technology, Department of Mathematics, 2013.

Bibliographic Details
Main Author: Loh, Po-Ru
Other Authors: Bonnie Berger.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2014
Subjects:
Online Access:http://hdl.handle.net/1721.1/83631
_version_ 1826210530577088512
author Loh, Po-Ru
author2 Bonnie Berger.
author_facet Bonnie Berger.
Loh, Po-Ru
author_sort Loh, Po-Ru
collection MIT
description Thesis (Ph. D.)--Massachusetts Institute of Technology, Department of Mathematics, 2013.
first_indexed 2024-09-23T14:51:26Z
format Thesis
id mit-1721.1/83631
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T14:51:26Z
publishDate 2014
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/836312019-04-11T11:32:49Z Algorithms for genomics and genetics : compression-accelerated search and admixture analysis Loh, Po-Ru Bonnie Berger. Massachusetts Institute of Technology. Department of Mathematics. Massachusetts Institute of Technology. Department of Mathematics. Mathematics. Thesis (Ph. D.)--Massachusetts Institute of Technology, Department of Mathematics, 2013. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 133-139). Rapid advances in next-generation sequencing technologies are revolutionizing genomics, with data sets at the scale of thousands of human genomes fast becoming the norm. These technological leaps promise to enable corresponding advances in biology and medicine, but the deluge of raw data poses substantial mathematical, computational and statistical challenges that must first be overcome. This thesis consists of two research thrusts along these lines. First, we propose an algorithmic framework, "compressive genomics," that accelerates bioinformatic computations through analysis-aware compression. We demonstrate this methodology with proof-of-concept implementations of compression-accelerated search (CaBLAST and CaBLAT). Second, we develop new computational tools for investigating population admixture, a phenomenon of importance in understanding demographic histories of human populations and facilitating association mapping of disease genes. Our recently released ALDER and MixMapper software packages provide fast, sensitive, and robust methods for detecting and analyzing signatures of admixture created by genetic drift and recombination on genome-wide, large-sample scales. by Po-Ru Loh. Ph.D. 2014-01-09T18:54:20Z 2014-01-09T18:54:20Z 2013 Thesis http://hdl.handle.net/1721.1/83631 864152524 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 139 pages application/pdf Massachusetts Institute of Technology
spellingShingle Mathematics.
Loh, Po-Ru
Algorithms for genomics and genetics : compression-accelerated search and admixture analysis
title Algorithms for genomics and genetics : compression-accelerated search and admixture analysis
title_full Algorithms for genomics and genetics : compression-accelerated search and admixture analysis
title_fullStr Algorithms for genomics and genetics : compression-accelerated search and admixture analysis
title_full_unstemmed Algorithms for genomics and genetics : compression-accelerated search and admixture analysis
title_short Algorithms for genomics and genetics : compression-accelerated search and admixture analysis
title_sort algorithms for genomics and genetics compression accelerated search and admixture analysis
topic Mathematics.
url http://hdl.handle.net/1721.1/83631
work_keys_str_mv AT lohporu algorithmsforgenomicsandgeneticscompressionacceleratedsearchandadmixtureanalysis