Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.

This study analyzes the SARS-CoV-2 genome sequence mutations by modeling its nucleotide mutations as a stochastic process in both the time-series and spatial domain of the gene sequence. In the time-series model, a Markov Chain embedded Poisson random process characterizes the mutation rate matrix,...

Full description

Bibliographic Details
Main Authors: Maverick Lim Kai Rong, Ercan Engin Kuruoglu, Wai Kin Victor Chan
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0284874
_version_ 1797828150756900864
author Maverick Lim Kai Rong
Ercan Engin Kuruoglu
Wai Kin Victor Chan
author_facet Maverick Lim Kai Rong
Ercan Engin Kuruoglu
Wai Kin Victor Chan
author_sort Maverick Lim Kai Rong
collection DOAJ
description This study analyzes the SARS-CoV-2 genome sequence mutations by modeling its nucleotide mutations as a stochastic process in both the time-series and spatial domain of the gene sequence. In the time-series model, a Markov Chain embedded Poisson random process characterizes the mutation rate matrix, while the spatial gene sequence model delineates the distribution of mutation inter-occurrence distances. Our experiment focuses on five key variants of concern that had become a global concern due to their high transmissibility and virulence. The time-series results reveal distinct asymmetries in mutation rate and propensities among different nucleotides and across different strains, with a mean mutation rate of approximately 2 mutations per month. In particular, our spatial gene sequence results reveal some novel biological insights on the characteristic distribution of mutation inter-occurrence distances, which display a notable pattern similar to other natural diseases. Our findings contribute interesting insights to the underlying biological mechanism of SARS-CoV-2 mutations, bringing us one step closer to improving the accuracy of existing mutation prediction models. This research could also potentially pave the way for future work in adopting similar spatial random process models and advanced spatial pattern recognition algorithms in order to characterize mutations on other different kinds of virus families.
first_indexed 2024-04-09T12:59:32Z
format Article
id doaj.art-e0076d820f2b4fc2adb81ad01f42e771
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-09T12:59:32Z
publishDate 2023-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-e0076d820f2b4fc2adb81ad01f42e7712023-05-13T05:31:14ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-01184e028487410.1371/journal.pone.0284874Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.Maverick Lim Kai RongErcan Engin KuruogluWai Kin Victor ChanThis study analyzes the SARS-CoV-2 genome sequence mutations by modeling its nucleotide mutations as a stochastic process in both the time-series and spatial domain of the gene sequence. In the time-series model, a Markov Chain embedded Poisson random process characterizes the mutation rate matrix, while the spatial gene sequence model delineates the distribution of mutation inter-occurrence distances. Our experiment focuses on five key variants of concern that had become a global concern due to their high transmissibility and virulence. The time-series results reveal distinct asymmetries in mutation rate and propensities among different nucleotides and across different strains, with a mean mutation rate of approximately 2 mutations per month. In particular, our spatial gene sequence results reveal some novel biological insights on the characteristic distribution of mutation inter-occurrence distances, which display a notable pattern similar to other natural diseases. Our findings contribute interesting insights to the underlying biological mechanism of SARS-CoV-2 mutations, bringing us one step closer to improving the accuracy of existing mutation prediction models. This research could also potentially pave the way for future work in adopting similar spatial random process models and advanced spatial pattern recognition algorithms in order to characterize mutations on other different kinds of virus families.https://doi.org/10.1371/journal.pone.0284874
spellingShingle Maverick Lim Kai Rong
Ercan Engin Kuruoglu
Wai Kin Victor Chan
Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.
PLoS ONE
title Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.
title_full Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.
title_fullStr Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.
title_full_unstemmed Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.
title_short Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.
title_sort modeling sars cov 2 nucleotide mutations as a stochastic process
url https://doi.org/10.1371/journal.pone.0284874
work_keys_str_mv AT mavericklimkairong modelingsarscov2nucleotidemutationsasastochasticprocess
AT ercanenginkuruoglu modelingsarscov2nucleotidemutationsasastochasticprocess
AT waikinvictorchan modelingsarscov2nucleotidemutationsasastochasticprocess