On calculating the probability of a set of orthologous sequences
Junfeng Liu1,2, Liang Chen3, Hongyu Zhao4, Dirk F Moore1,2, Yong Lin1,2, Weichung Joe Shih1,21Biometrics Division, The Cancer, Institute of New Jersey, New Brunswick, NJ, USA; 2Department of Biostatistics, School of Public Health, University of Medicine and Dentistry of New Jersey, Piscataway, NJ, U...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Dove Medical Press
2009-02-01
|
Series: | Advances and Applications in Bioinformatics and Chemistry |
Online Access: | http://www.dovepress.com/on-calculating-the-probability-of-a-set-of-orthologous-sequences-a2885 |
_version_ | 1811300625155096576 |
---|---|
author | Junfeng Liu Liang Chen Hongyu Zhao Dirk F Moore Yong Lin Weichung Joe Shih |
author_facet | Junfeng Liu Liang Chen Hongyu Zhao Dirk F Moore Yong Lin Weichung Joe Shih |
author_sort | Junfeng Liu |
collection | DOAJ |
description | Junfeng Liu1,2, Liang Chen3, Hongyu Zhao4, Dirk F Moore1,2, Yong Lin1,2, Weichung Joe Shih1,21Biometrics Division, The Cancer, Institute of New Jersey, New Brunswick, NJ, USA; 2Department of Biostatistics, School of Public Health, University of Medicine and Dentistry of New Jersey, Piscataway, NJ, USA; 3Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA; 4Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT, USAAbstract: Probabilistic DNA sequence models have been intensively applied to genome research. Within the evolutionary biology framework, this article investigates the feasibility for rigorously estimating the probability of a set of orthologous DNA sequences which evolve from a common progenitor. We propose Monte Carlo integration algorithms to sample the unknown ancestral and/or root sequences a posteriori conditional on a reference sequence and apply pairwise Needleman–Wunsch alignment between the sampled and nonreference species sequences to estimate the probability. We test our algorithms on both simulated and real sequences and compare calculated probabilities from Monte Carlo integration to those induced by single multiple alignment.Keywords: evolution, Jukes–Cantor model, Monte Carlo integration, Needleman–Wunsch alignment, orthologous |
first_indexed | 2024-04-13T06:54:08Z |
format | Article |
id | doaj.art-8181a676808f44a6989244df295a4fe0 |
institution | Directory Open Access Journal |
issn | 1178-6949 |
language | English |
last_indexed | 2024-04-13T06:54:08Z |
publishDate | 2009-02-01 |
publisher | Dove Medical Press |
record_format | Article |
series | Advances and Applications in Bioinformatics and Chemistry |
spelling | doaj.art-8181a676808f44a6989244df295a4fe02022-12-22T02:57:18ZengDove Medical PressAdvances and Applications in Bioinformatics and Chemistry1178-69492009-02-012009default3748On calculating the probability of a set of orthologous sequencesJunfeng LiuLiang ChenHongyu ZhaoDirk F MooreYong LinWeichung Joe ShihJunfeng Liu1,2, Liang Chen3, Hongyu Zhao4, Dirk F Moore1,2, Yong Lin1,2, Weichung Joe Shih1,21Biometrics Division, The Cancer, Institute of New Jersey, New Brunswick, NJ, USA; 2Department of Biostatistics, School of Public Health, University of Medicine and Dentistry of New Jersey, Piscataway, NJ, USA; 3Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA; 4Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT, USAAbstract: Probabilistic DNA sequence models have been intensively applied to genome research. Within the evolutionary biology framework, this article investigates the feasibility for rigorously estimating the probability of a set of orthologous DNA sequences which evolve from a common progenitor. We propose Monte Carlo integration algorithms to sample the unknown ancestral and/or root sequences a posteriori conditional on a reference sequence and apply pairwise Needleman–Wunsch alignment between the sampled and nonreference species sequences to estimate the probability. We test our algorithms on both simulated and real sequences and compare calculated probabilities from Monte Carlo integration to those induced by single multiple alignment.Keywords: evolution, Jukes–Cantor model, Monte Carlo integration, Needleman–Wunsch alignment, orthologoushttp://www.dovepress.com/on-calculating-the-probability-of-a-set-of-orthologous-sequences-a2885 |
spellingShingle | Junfeng Liu Liang Chen Hongyu Zhao Dirk F Moore Yong Lin Weichung Joe Shih On calculating the probability of a set of orthologous sequences Advances and Applications in Bioinformatics and Chemistry |
title | On calculating the probability of a set of orthologous sequences |
title_full | On calculating the probability of a set of orthologous sequences |
title_fullStr | On calculating the probability of a set of orthologous sequences |
title_full_unstemmed | On calculating the probability of a set of orthologous sequences |
title_short | On calculating the probability of a set of orthologous sequences |
title_sort | on calculating the probability of a set of orthologous sequences |
url | http://www.dovepress.com/on-calculating-the-probability-of-a-set-of-orthologous-sequences-a2885 |
work_keys_str_mv | AT junfengliu oncalculatingtheprobabilityofasetoforthologoussequences AT liangchen oncalculatingtheprobabilityofasetoforthologoussequences AT hongyuzhao oncalculatingtheprobabilityofasetoforthologoussequences AT dirkfmoore oncalculatingtheprobabilityofasetoforthologoussequences AT yonglin oncalculatingtheprobabilityofasetoforthologoussequences AT weichungjoeshih oncalculatingtheprobabilityofasetoforthologoussequences |