Methods and analysis of genome-scale gene family evolution across multiple species
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2011
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/62433 |
_version_ | 1811074089030254592 |
---|---|
author | Rasmussen, Matthew D. (Matthew David) |
author2 | Manolis Kellis. |
author_facet | Manolis Kellis. Rasmussen, Matthew D. (Matthew David) |
author_sort | Rasmussen, Matthew D. (Matthew David) |
collection | MIT |
description | Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010. |
first_indexed | 2024-09-23T09:42:58Z |
format | Thesis |
id | mit-1721.1/62433 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T09:42:58Z |
publishDate | 2011 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/624332019-04-12T09:02:26Z Methods and analysis of genome-scale gene family evolution across multiple species Rasmussen, Matthew D. (Matthew David) Manolis Kellis. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010. Cataloged from PDF version of thesis. Includes bibliographical references (p. 123-136). The fields of genomics and evolution have continually benefited from one another in their common goal of understanding the biological world. This partnership has been accelerated by ever increasing sequencing and high-throughput technologies. Although the future of genomic and evolutionary studies is bright, new models and methods will be needed to address the growing and changing challenges of large-scale datasets. In this work, I explore how evolution generates the diversity of life we see in modern species, specifically the evolution of new genes and functions. By reconstructing the history of the diverse sequences present in modern species, we can improve our understanding of their function and evolutionary importance. Performing such an analysis requires a principled and efficient means of computing the most probable evolutionary scenarios. To address these challenges, I introduce a new model of gene family evolution as well as a new method SPIMAP, an efficient Bayesian method for reconstructing gene trees in the presence of a known species tree. We observe many improvements in reconstruction accuracy, achieved by modeling multiple aspects of evolution, including gene duplication and loss rates, speciation times, and correlated substitution rate variation across both species and loci. I have implemented and applied this method on two clades of fully-sequenced species, 12 Drosophila and 16 fungal genomes as well as simulated phylogenies, and find dramatic improvements in reconstruction accuracy as compared to the most popular existing methods, including those that take the species tree into account. Lastly, I use the SPIMAP method to reconstruct the evolutionary history of all gene families in 16 fungal species including several relatives of the pathogenic species C. albicans. From these reconstructions, we identify several families enriched with duplications and positive selection in pathogenic lineages. Theses reconstructions shed light on the evolution of these species as well as a better understanding of the genes involved in pathogenicity. by Matthew D. Rasmussen. Ph.D. 2011-04-25T15:58:00Z 2011-04-25T15:58:00Z 2010 2010 Thesis http://hdl.handle.net/1721.1/62433 710994361 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 136 p. application/pdf Massachusetts Institute of Technology |
spellingShingle | Electrical Engineering and Computer Science. Rasmussen, Matthew D. (Matthew David) Methods and analysis of genome-scale gene family evolution across multiple species |
title | Methods and analysis of genome-scale gene family evolution across multiple species |
title_full | Methods and analysis of genome-scale gene family evolution across multiple species |
title_fullStr | Methods and analysis of genome-scale gene family evolution across multiple species |
title_full_unstemmed | Methods and analysis of genome-scale gene family evolution across multiple species |
title_short | Methods and analysis of genome-scale gene family evolution across multiple species |
title_sort | methods and analysis of genome scale gene family evolution across multiple species |
topic | Electrical Engineering and Computer Science. |
url | http://hdl.handle.net/1721.1/62433 |
work_keys_str_mv | AT rasmussenmatthewdmatthewdavid methodsandanalysisofgenomescalegenefamilyevolutionacrossmultiplespecies |