pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science

Abstract Background Biomolecular interactions that modulate biological processes occur mainly in cavities throughout the surface of biomolecular structures. In the data science era, structural biology has benefited from the increasing availability of biostructural data due to advances in structural...

Full description

Bibliographic Details
Main Authors: João Victor da Silva Guerra, Helder Veras Ribeiro-Filho, Gabriel Ernesto Jara, Leandro Oliveira Bortot, José Geraldo de Carvalho Pereira, Paulo Sérgio Lopes-de-Oliveira
Format: Article
Language:English
Published: BMC 2021-12-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04519-4
_version_ 1819102971457175552
author João Victor da Silva Guerra
Helder Veras Ribeiro-Filho
Gabriel Ernesto Jara
Leandro Oliveira Bortot
José Geraldo de Carvalho Pereira
Paulo Sérgio Lopes-de-Oliveira
author_facet João Victor da Silva Guerra
Helder Veras Ribeiro-Filho
Gabriel Ernesto Jara
Leandro Oliveira Bortot
José Geraldo de Carvalho Pereira
Paulo Sérgio Lopes-de-Oliveira
author_sort João Victor da Silva Guerra
collection DOAJ
description Abstract Background Biomolecular interactions that modulate biological processes occur mainly in cavities throughout the surface of biomolecular structures. In the data science era, structural biology has benefited from the increasing availability of biostructural data due to advances in structural determination and computational methods. In this scenario, data-intensive cavity analysis demands efficient scripting routines built on easily manipulated data structures. To fulfill this need, we developed pyKVFinder, a Python package to detect and characterize cavities in biomolecular structures for data science and automated pipelines. Results pyKVFinder efficiently detects cavities in biomolecular structures and computes their volume, area, depth and hydropathy, storing these cavity properties in NumPy arrays. Benefited from Python ecosystem interoperability and data structures, pyKVFinder can be integrated with third-party scientific packages and libraries for mathematical calculations, machine learning and 3D visualization in automated workflows. As proof of pyKVFinder’s capabilities, we successfully identified and compared ADRP substrate-binding site of SARS-CoV-2 and a set of homologous proteins with pyKVFinder, showing its integrability with data science packages such as matplotlib, NGL Viewer, SciPy and Jupyter notebook. Conclusions We introduce an efficient, highly versatile and easily integrable software for detecting and characterizing biomolecular cavities in data science applications and automated protocols. pyKVFinder facilitates biostructural data analysis with scripting routines in the Python ecosystem and can be building blocks for data science and drug design applications.
first_indexed 2024-12-22T01:43:02Z
format Article
id doaj.art-8ed22d93639c46f7a27d48425ed79d7f
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-22T01:43:02Z
publishDate 2021-12-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-8ed22d93639c46f7a27d48425ed79d7f2022-12-21T18:43:09ZengBMCBMC Bioinformatics1471-21052021-12-0122111310.1186/s12859-021-04519-4pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data scienceJoão Victor da Silva Guerra0Helder Veras Ribeiro-Filho1Gabriel Ernesto Jara2Leandro Oliveira Bortot3José Geraldo de Carvalho Pereira4Paulo Sérgio Lopes-de-Oliveira5Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio)Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio)Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio)Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio)Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio)Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio)Abstract Background Biomolecular interactions that modulate biological processes occur mainly in cavities throughout the surface of biomolecular structures. In the data science era, structural biology has benefited from the increasing availability of biostructural data due to advances in structural determination and computational methods. In this scenario, data-intensive cavity analysis demands efficient scripting routines built on easily manipulated data structures. To fulfill this need, we developed pyKVFinder, a Python package to detect and characterize cavities in biomolecular structures for data science and automated pipelines. Results pyKVFinder efficiently detects cavities in biomolecular structures and computes their volume, area, depth and hydropathy, storing these cavity properties in NumPy arrays. Benefited from Python ecosystem interoperability and data structures, pyKVFinder can be integrated with third-party scientific packages and libraries for mathematical calculations, machine learning and 3D visualization in automated workflows. As proof of pyKVFinder’s capabilities, we successfully identified and compared ADRP substrate-binding site of SARS-CoV-2 and a set of homologous proteins with pyKVFinder, showing its integrability with data science packages such as matplotlib, NGL Viewer, SciPy and Jupyter notebook. Conclusions We introduce an efficient, highly versatile and easily integrable software for detecting and characterizing biomolecular cavities in data science applications and automated protocols. pyKVFinder facilitates biostructural data analysis with scripting routines in the Python ecosystem and can be building blocks for data science and drug design applications.https://doi.org/10.1186/s12859-021-04519-4Cavity detectionCavity characterizationNumPyPythonData structureData science
spellingShingle João Victor da Silva Guerra
Helder Veras Ribeiro-Filho
Gabriel Ernesto Jara
Leandro Oliveira Bortot
José Geraldo de Carvalho Pereira
Paulo Sérgio Lopes-de-Oliveira
pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science
BMC Bioinformatics
Cavity detection
Cavity characterization
NumPy
Python
Data structure
Data science
title pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science
title_full pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science
title_fullStr pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science
title_full_unstemmed pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science
title_short pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science
title_sort pykvfinder an efficient and integrable python package for biomolecular cavity detection and characterization in data science
topic Cavity detection
Cavity characterization
NumPy
Python
Data structure
Data science
url https://doi.org/10.1186/s12859-021-04519-4
work_keys_str_mv AT joaovictordasilvaguerra pykvfinderanefficientandintegrablepythonpackageforbiomolecularcavitydetectionandcharacterizationindatascience
AT helderverasribeirofilho pykvfinderanefficientandintegrablepythonpackageforbiomolecularcavitydetectionandcharacterizationindatascience
AT gabrielernestojara pykvfinderanefficientandintegrablepythonpackageforbiomolecularcavitydetectionandcharacterizationindatascience
AT leandrooliveirabortot pykvfinderanefficientandintegrablepythonpackageforbiomolecularcavitydetectionandcharacterizationindatascience
AT josegeraldodecarvalhopereira pykvfinderanefficientandintegrablepythonpackageforbiomolecularcavitydetectionandcharacterizationindatascience
AT paulosergiolopesdeoliveira pykvfinderanefficientandintegrablepythonpackageforbiomolecularcavitydetectionandcharacterizationindatascience