A graph-theoretic approach for inparalog detection

<p>Abstract</p> <p>Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this histo...

Full description

Bibliographic Details
Main Authors: Tremblay-Savard Olivier, Swenson Krister M
Format: Article
Language:English
Published: BMC 2012-12-01
Series:BMC Bioinformatics
_version_ 1811318332459057152
author Tremblay-Savard Olivier
Swenson Krister M
author_facet Tremblay-Savard Olivier
Swenson Krister M
author_sort Tremblay-Savard Olivier
collection DOAJ
description <p>Abstract</p> <p>Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this history; relationships between genes such as orthology (genes related through a speciation event) or paralogy (genes related through a duplication event) are usually correlated with these features. For example, recent work has shown that in human and mouse there is a strong connection between function and inparalogs, the paralogs that were created since the speciation event separating the human and mouse lineages. Methods exist for detecting inparalogs that either use information from only two species, or consider a set of species but rely on clustering methods. In this paper we present a graph-theoretic approach for finding lower bounds on the number of inparalogs for a given set of species; we pose an edge covering problem on the similarity graph and give an efficient 2/3-approximation as well as a faster heuristic. Since the physical position of inparalogs corresponding to recent speciations is not likely to have changed since the duplication, we also use our predictions to estimate the types of duplications that have occurred in some vertebrates and drosophila.</p>
first_indexed 2024-04-13T12:23:46Z
format Article
id doaj.art-dd94a491a5394dd29d5ed738ff771a93
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-13T12:23:46Z
publishDate 2012-12-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-dd94a491a5394dd29d5ed738ff771a932022-12-22T02:47:05ZengBMCBMC Bioinformatics1471-21052012-12-0113Suppl 19S1610.1186/1471-2105-13-S19-S16A graph-theoretic approach for inparalog detectionTremblay-Savard OlivierSwenson Krister M<p>Abstract</p> <p>Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this history; relationships between genes such as orthology (genes related through a speciation event) or paralogy (genes related through a duplication event) are usually correlated with these features. For example, recent work has shown that in human and mouse there is a strong connection between function and inparalogs, the paralogs that were created since the speciation event separating the human and mouse lineages. Methods exist for detecting inparalogs that either use information from only two species, or consider a set of species but rely on clustering methods. In this paper we present a graph-theoretic approach for finding lower bounds on the number of inparalogs for a given set of species; we pose an edge covering problem on the similarity graph and give an efficient 2/3-approximation as well as a faster heuristic. Since the physical position of inparalogs corresponding to recent speciations is not likely to have changed since the duplication, we also use our predictions to estimate the types of duplications that have occurred in some vertebrates and drosophila.</p>
spellingShingle Tremblay-Savard Olivier
Swenson Krister M
A graph-theoretic approach for inparalog detection
BMC Bioinformatics
title A graph-theoretic approach for inparalog detection
title_full A graph-theoretic approach for inparalog detection
title_fullStr A graph-theoretic approach for inparalog detection
title_full_unstemmed A graph-theoretic approach for inparalog detection
title_short A graph-theoretic approach for inparalog detection
title_sort graph theoretic approach for inparalog detection
work_keys_str_mv AT tremblaysavardolivier agraphtheoreticapproachforinparalogdetection
AT swensonkristerm agraphtheoreticapproachforinparalogdetection
AT tremblaysavardolivier graphtheoreticapproachforinparalogdetection
AT swensonkristerm graphtheoreticapproachforinparalogdetection