Comparative gene identification in mammalian, fly, and fungal genomes
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2007
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/36807 |
_version_ | 1811097631929139200 |
---|---|
author | Lin, Michael F. (Michael Fong-Jay) |
author2 | Manolis Kellis. |
author_facet | Manolis Kellis. Lin, Michael F. (Michael Fong-Jay) |
author_sort | Lin, Michael F. (Michael Fong-Jay) |
collection | MIT |
description | Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. |
first_indexed | 2024-09-23T17:02:33Z |
format | Thesis |
id | mit-1721.1/36807 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T17:02:33Z |
publishDate | 2007 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/368072019-04-12T16:06:29Z Comparative gene identification in mammalian, fly, and fungal genomes Lin, Michael F. (Michael Fong-Jay) Manolis Kellis. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. Includes bibliographical references (leaves 55-56). An important step in genome interpretation is the accurate identification of protein-coding genes. One approach to gene identification is comparative analysis of the genomes of several related species, to find genes that have been conserved by natural selection over millions of years of evolution. I develop general computational methods that combine statistical analysis of genome sequence alignments with classification algorithms in order to detect the distinctive signatures of protein-coding DNA sequence evolution. I implement these methods as a software system, which I then use to identify previously unknown genes, and cast doubt on some existing gene annotations, in the genomes of the fungi Saccharomyces cerevisiae and Candida albicans, the fruit fly Drosophila melanogaster, and the human. These methods perform competitively with the best existing de novo gene identification systems, and are practically applicable to the goal of improving existing gene annotations through comparative genomics. by Michael F. Lin. M.Eng. 2007-03-12T17:55:35Z 2007-03-12T17:55:35Z 2006 2006 Thesis http://hdl.handle.net/1721.1/36807 80777803 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 56 leaves application/pdf Massachusetts Institute of Technology |
spellingShingle | Electrical Engineering and Computer Science. Lin, Michael F. (Michael Fong-Jay) Comparative gene identification in mammalian, fly, and fungal genomes |
title | Comparative gene identification in mammalian, fly, and fungal genomes |
title_full | Comparative gene identification in mammalian, fly, and fungal genomes |
title_fullStr | Comparative gene identification in mammalian, fly, and fungal genomes |
title_full_unstemmed | Comparative gene identification in mammalian, fly, and fungal genomes |
title_short | Comparative gene identification in mammalian, fly, and fungal genomes |
title_sort | comparative gene identification in mammalian fly and fungal genomes |
topic | Electrical Engineering and Computer Science. |
url | http://hdl.handle.net/1721.1/36807 |
work_keys_str_mv | AT linmichaelfmichaelfongjay comparativegeneidentificationinmammalianflyandfungalgenomes |