In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide Substrates

The SARS-CoV-2 main protease (M<sup>pro</sup>) is essential for the life cycle of the COVID-19 virus. It cleaves the two polyproteins at 11 positions to generate mature proteins for virion formation. The cleavage site on these polyproteins is known to be Leu-Gln↓(Ser/Ala/Gly). A range of...

Full description

Bibliographic Details
Main Authors: Sophakama Zabo, Kevin Alan Lobb
Format: Article
Language:English
Published: MDPI AG 2023-06-01
Series:Viruses
Subjects:
Online Access:https://www.mdpi.com/1999-4915/15/7/1480
_version_ 1797587273049440256
author Sophakama Zabo
Kevin Alan Lobb
author_facet Sophakama Zabo
Kevin Alan Lobb
author_sort Sophakama Zabo
collection DOAJ
description The SARS-CoV-2 main protease (M<sup>pro</sup>) is essential for the life cycle of the COVID-19 virus. It cleaves the two polyproteins at 11 positions to generate mature proteins for virion formation. The cleavage site on these polyproteins is known to be Leu-Gln↓(Ser/Ala/Gly). A range of hexapeptides that follow the known sequence for recognition and cleavage was constructed using RDKit libraries and complexed with the crystal structure of M<sup>pro</sup> (PDB ID 6XHM) through extensive molecular docking calculations. A subset of 131 of these complexes underwent 20 ns molecular dynamics simulations. The analyses of the trajectories from molecular dynamics included principal component analysis (PCA), and a method to compare PCA plots from separate trajectories was developed in terms of encoding PCA progression during the simulations. The hexapeptides formed stable complexes as expected, with reproducible molecular docking of the substrates given the extensiveness of the procedure. Only Lys-Leu-Gln*** (KLQ***) sequence complexes were studied for molecular dynamics. In this subset of complexes, the PCA analysis identified four classifications of protein motions across these sequences. KLQ*** complexes illustrated the effect of changes in substrate on the active site, with implications for understanding the substrate recognition of M<sup>pro</sup> and informing the development of small molecule inhibitors.
first_indexed 2024-03-11T00:34:47Z
format Article
id doaj.art-fbb132e1689f41619ad75e06e7342619
institution Directory Open Access Journal
issn 1999-4915
language English
last_indexed 2024-03-11T00:34:47Z
publishDate 2023-06-01
publisher MDPI AG
record_format Article
series Viruses
spelling doaj.art-fbb132e1689f41619ad75e06e73426192023-11-18T21:44:17ZengMDPI AGViruses1999-49152023-06-01157148010.3390/v15071480In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide SubstratesSophakama Zabo0Kevin Alan Lobb1Department of Chemistry, Rhodes University, Makhanda 6139, South AfricaDepartment of Chemistry, Rhodes University, Makhanda 6139, South AfricaThe SARS-CoV-2 main protease (M<sup>pro</sup>) is essential for the life cycle of the COVID-19 virus. It cleaves the two polyproteins at 11 positions to generate mature proteins for virion formation. The cleavage site on these polyproteins is known to be Leu-Gln↓(Ser/Ala/Gly). A range of hexapeptides that follow the known sequence for recognition and cleavage was constructed using RDKit libraries and complexed with the crystal structure of M<sup>pro</sup> (PDB ID 6XHM) through extensive molecular docking calculations. A subset of 131 of these complexes underwent 20 ns molecular dynamics simulations. The analyses of the trajectories from molecular dynamics included principal component analysis (PCA), and a method to compare PCA plots from separate trajectories was developed in terms of encoding PCA progression during the simulations. The hexapeptides formed stable complexes as expected, with reproducible molecular docking of the substrates given the extensiveness of the procedure. Only Lys-Leu-Gln*** (KLQ***) sequence complexes were studied for molecular dynamics. In this subset of complexes, the PCA analysis identified four classifications of protein motions across these sequences. KLQ*** complexes illustrated the effect of changes in substrate on the active site, with implications for understanding the substrate recognition of M<sup>pro</sup> and informing the development of small molecule inhibitors.https://www.mdpi.com/1999-4915/15/7/1480SARS-CoV-2 main proteaseprotein substratemulti-conformer substrate librarymolecular dockingmolecular dynamicsPCA
spellingShingle Sophakama Zabo
Kevin Alan Lobb
In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide Substrates
Viruses
SARS-CoV-2 main protease
protein substrate
multi-conformer substrate library
molecular docking
molecular dynamics
PCA
title In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide Substrates
title_full In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide Substrates
title_fullStr In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide Substrates
title_full_unstemmed In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide Substrates
title_short In Silico Substrate-Binding Profiling for SARS-CoV-2 Main Protease (M<sup>pro</sup>) Using Hexapeptide Substrates
title_sort in silico substrate binding profiling for sars cov 2 main protease m sup pro sup using hexapeptide substrates
topic SARS-CoV-2 main protease
protein substrate
multi-conformer substrate library
molecular docking
molecular dynamics
PCA
url https://www.mdpi.com/1999-4915/15/7/1480
work_keys_str_mv AT sophakamazabo insilicosubstratebindingprofilingforsarscov2mainproteasemsupprosupusinghexapeptidesubstrates
AT kevinalanlobb insilicosubstratebindingprofilingforsarscov2mainproteasemsupprosupusinghexapeptidesubstrates