Comparative gene identification in mammalian, fly, and fungal genomes

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.

Bibliographic Details
Main Author: Lin, Michael F. (Michael Fong-Jay)
Other Authors: Manolis Kellis.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2007
Subjects:
Online Access:http://hdl.handle.net/1721.1/36807
_version_ 1811097631929139200
author Lin, Michael F. (Michael Fong-Jay)
author2 Manolis Kellis.
author_facet Manolis Kellis.
Lin, Michael F. (Michael Fong-Jay)
author_sort Lin, Michael F. (Michael Fong-Jay)
collection MIT
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.
first_indexed 2024-09-23T17:02:33Z
format Thesis
id mit-1721.1/36807
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T17:02:33Z
publishDate 2007
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/368072019-04-12T16:06:29Z Comparative gene identification in mammalian, fly, and fungal genomes Lin, Michael F. (Michael Fong-Jay) Manolis Kellis. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. Includes bibliographical references (leaves 55-56). An important step in genome interpretation is the accurate identification of protein-coding genes. One approach to gene identification is comparative analysis of the genomes of several related species, to find genes that have been conserved by natural selection over millions of years of evolution. I develop general computational methods that combine statistical analysis of genome sequence alignments with classification algorithms in order to detect the distinctive signatures of protein-coding DNA sequence evolution. I implement these methods as a software system, which I then use to identify previously unknown genes, and cast doubt on some existing gene annotations, in the genomes of the fungi Saccharomyces cerevisiae and Candida albicans, the fruit fly Drosophila melanogaster, and the human. These methods perform competitively with the best existing de novo gene identification systems, and are practically applicable to the goal of improving existing gene annotations through comparative genomics. by Michael F. Lin. M.Eng. 2007-03-12T17:55:35Z 2007-03-12T17:55:35Z 2006 2006 Thesis http://hdl.handle.net/1721.1/36807 80777803 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 56 leaves application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Lin, Michael F. (Michael Fong-Jay)
Comparative gene identification in mammalian, fly, and fungal genomes
title Comparative gene identification in mammalian, fly, and fungal genomes
title_full Comparative gene identification in mammalian, fly, and fungal genomes
title_fullStr Comparative gene identification in mammalian, fly, and fungal genomes
title_full_unstemmed Comparative gene identification in mammalian, fly, and fungal genomes
title_short Comparative gene identification in mammalian, fly, and fungal genomes
title_sort comparative gene identification in mammalian fly and fungal genomes
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/36807
work_keys_str_mv AT linmichaelfmichaelfongjay comparativegeneidentificationinmammalianflyandfungalgenomes