AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders

Abstract Background Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critica...

Full description

Bibliographic Details
Main Authors: Incheol Shin, Keumseok Kang, Juseong Kim, Sanghun Sel, Jeonghoon Choi, Jae-Wook Lee, Ho Young Kang, Giltae Song
Format: Article
Language:English
Published: BMC 2023-11-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-023-05577-6
_version_ 1797413795056844800
author Incheol Shin
Keumseok Kang
Juseong Kim
Sanghun Sel
Jeonghoon Choi
Jae-Wook Lee
Ho Young Kang
Giltae Song
author_facet Incheol Shin
Keumseok Kang
Juseong Kim
Sanghun Sel
Jeonghoon Choi
Jae-Wook Lee
Ho Young Kang
Giltae Song
author_sort Incheol Shin
collection DOAJ
description Abstract Background Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vitro technique employed to identify aptamers that bind specifically to target proteins. While advanced SELEX-based methods such as Cell- and HT-SELEX are available, they often encounter issues such as extended time consumption and suboptimal accuracy. Several In silico aptamer discovery methods have been proposed to address these challenges. These methods are specifically designed to predict aptamer-protein interaction (API) using benchmark datasets. However, these methods often fail to consider the physicochemical interactions between aptamers and proteins within tertiary structures. Results In this study, we propose AptaTrans, a pipeline for predicting API using deep learning techniques. AptaTrans uses transformer-based encoders to handle aptamer and protein sequences at the monomer level. Furthermore, pretrained encoders are utilized for the structural representation. After validation with a benchmark dataset, AptaTrans has been integrated into a comprehensive toolset. This pipeline synergistically combines with Apta-MCTS, a generative algorithm for recommending aptamer candidates. Conclusion The results show that AptaTrans outperforms existing models for predicting API, and the efficacy of the AptaTrans pipeline has been confirmed through various experimental tools. We expect AptaTrans will enhance the cost-effectiveness and efficiency of SELEX in drug discovery. The source code and benchmark dataset for AptaTrans are available at https://github.com/pnumlb/AptaTrans .
first_indexed 2024-03-09T05:24:02Z
format Article
id doaj.art-2a42a0ea0aa746b1b187a360af037ae2
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-03-09T05:24:02Z
publishDate 2023-11-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-2a42a0ea0aa746b1b187a360af037ae22023-12-03T12:38:17ZengBMCBMC Bioinformatics1471-21052023-11-0124112010.1186/s12859-023-05577-6AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encodersIncheol Shin0Keumseok Kang1Juseong Kim2Sanghun Sel3Jeonghoon Choi4Jae-Wook Lee5Ho Young Kang6Giltae Song7Division of Artificial Intelligence, Pusan National UniversityDivision of Artificial Intelligence, Pusan National UniversityDivision of Artificial Intelligence, Pusan National UniversityDivision of Artificial Intelligence, Pusan National UniversityDivision of Artificial Intelligence, Pusan National UniversityResearch & DevelopmentResearch & DevelopmentDivision of Artificial Intelligence, Pusan National UniversityAbstract Background Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vitro technique employed to identify aptamers that bind specifically to target proteins. While advanced SELEX-based methods such as Cell- and HT-SELEX are available, they often encounter issues such as extended time consumption and suboptimal accuracy. Several In silico aptamer discovery methods have been proposed to address these challenges. These methods are specifically designed to predict aptamer-protein interaction (API) using benchmark datasets. However, these methods often fail to consider the physicochemical interactions between aptamers and proteins within tertiary structures. Results In this study, we propose AptaTrans, a pipeline for predicting API using deep learning techniques. AptaTrans uses transformer-based encoders to handle aptamer and protein sequences at the monomer level. Furthermore, pretrained encoders are utilized for the structural representation. After validation with a benchmark dataset, AptaTrans has been integrated into a comprehensive toolset. This pipeline synergistically combines with Apta-MCTS, a generative algorithm for recommending aptamer candidates. Conclusion The results show that AptaTrans outperforms existing models for predicting API, and the efficacy of the AptaTrans pipeline has been confirmed through various experimental tools. We expect AptaTrans will enhance the cost-effectiveness and efficiency of SELEX in drug discovery. The source code and benchmark dataset for AptaTrans are available at https://github.com/pnumlb/AptaTrans .https://doi.org/10.1186/s12859-023-05577-6Aptamer protein interactionTransformerPretraingStructural representationSELEX
spellingShingle Incheol Shin
Keumseok Kang
Juseong Kim
Sanghun Sel
Jeonghoon Choi
Jae-Wook Lee
Ho Young Kang
Giltae Song
AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
BMC Bioinformatics
Aptamer protein interaction
Transformer
Pretraing
Structural representation
SELEX
title AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_full AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_fullStr AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_full_unstemmed AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_short AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_sort aptatrans a deep neural network for predicting aptamer protein interaction using pretrained encoders
topic Aptamer protein interaction
Transformer
Pretraing
Structural representation
SELEX
url https://doi.org/10.1186/s12859-023-05577-6
work_keys_str_mv AT incheolshin aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT keumseokkang aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT juseongkim aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT sanghunsel aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT jeonghoonchoi aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT jaewooklee aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT hoyoungkang aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT giltaesong aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders