EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction

The identification of drug-target interaction (DTI) plays a key role in drug discovery and development. Benefitting from large-scale drug databases and verified DTI relationships, a lot of machine-learning methods have been developed to predict DTIs. However, due to the difficulty in extracting usef...

Full description

Bibliographic Details
Main Authors: Yuan Jin, Jiarui Lu, Runhan Shi, Yang Yang
Format: Article
Language:English
Published: MDPI AG 2021-11-01
Series:Biomolecules
Subjects:
Online Access:https://www.mdpi.com/2218-273X/11/12/1783
_version_ 1827673927378272256
author Yuan Jin
Jiarui Lu
Runhan Shi
Yang Yang
author_facet Yuan Jin
Jiarui Lu
Runhan Shi
Yang Yang
author_sort Yuan Jin
collection DOAJ
description The identification of drug-target interaction (DTI) plays a key role in drug discovery and development. Benefitting from large-scale drug databases and verified DTI relationships, a lot of machine-learning methods have been developed to predict DTIs. However, due to the difficulty in extracting useful information from molecules, the performance of these methods is limited by the representation of drugs and target proteins. This study proposes a new model called EmbedDTI to enhance the representation of both drugs and target proteins, and improve the performance of DTI prediction. For protein sequences, we leverage language modeling for pretraining the feature embeddings of amino acids and feed them to a convolutional neural network model for further representation learning. For drugs, we build two levels of graphs to represent compound structural information, namely the atom graph and substructure graph, and adopt graph convolutional network with an attention module to learn the embedding vectors for the graphs. We compare EmbedDTI with the existing DTI predictors on two benchmark datasets. The experimental results show that EmbedDTI outperforms the state-of-the-art models, and the attention module can identify the components crucial for DTIs in compounds.
first_indexed 2024-03-10T04:32:34Z
format Article
id doaj.art-32e786e7a336434999ea7c3af1037345
institution Directory Open Access Journal
issn 2218-273X
language English
last_indexed 2024-03-10T04:32:34Z
publishDate 2021-11-01
publisher MDPI AG
record_format Article
series Biomolecules
spelling doaj.art-32e786e7a336434999ea7c3af10373452023-11-23T03:59:04ZengMDPI AGBiomolecules2218-273X2021-11-011112178310.3390/biom11121783EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target InteractionYuan Jin0Jiarui Lu1Runhan Shi2Yang Yang3Center for Brain-Like Computing and Machine Intelligence, Department of Computer Science and Engineering, Shanghai Jiao Tong University, 800 Dong Chuan Rd., Shanghai 200240, ChinaSchool of Chemistry and Chemical Engineering, Shanghai Jiao Tong University, 800 Dong Chuan Rd., Shanghai 200240, ChinaCenter for Brain-Like Computing and Machine Intelligence, Department of Computer Science and Engineering, Shanghai Jiao Tong University, 800 Dong Chuan Rd., Shanghai 200240, ChinaCenter for Brain-Like Computing and Machine Intelligence, Department of Computer Science and Engineering, Shanghai Jiao Tong University, 800 Dong Chuan Rd., Shanghai 200240, ChinaThe identification of drug-target interaction (DTI) plays a key role in drug discovery and development. Benefitting from large-scale drug databases and verified DTI relationships, a lot of machine-learning methods have been developed to predict DTIs. However, due to the difficulty in extracting useful information from molecules, the performance of these methods is limited by the representation of drugs and target proteins. This study proposes a new model called EmbedDTI to enhance the representation of both drugs and target proteins, and improve the performance of DTI prediction. For protein sequences, we leverage language modeling for pretraining the feature embeddings of amino acids and feed them to a convolutional neural network model for further representation learning. For drugs, we build two levels of graphs to represent compound structural information, namely the atom graph and substructure graph, and adopt graph convolutional network with an attention module to learn the embedding vectors for the graphs. We compare EmbedDTI with the existing DTI predictors on two benchmark datasets. The experimental results show that EmbedDTI outperforms the state-of-the-art models, and the attention module can identify the components crucial for DTIs in compounds.https://www.mdpi.com/2218-273X/11/12/1783drug-target interactiongraph convolutional networkmolecular representation
spellingShingle Yuan Jin
Jiarui Lu
Runhan Shi
Yang Yang
EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
Biomolecules
drug-target interaction
graph convolutional network
molecular representation
title EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_full EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_fullStr EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_full_unstemmed EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_short EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_sort embeddti enhancing the molecular representations via sequence embedding and graph convolutional network for the prediction of drug target interaction
topic drug-target interaction
graph convolutional network
molecular representation
url https://www.mdpi.com/2218-273X/11/12/1783
work_keys_str_mv AT yuanjin embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction
AT jiaruilu embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction
AT runhanshi embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction
AT yangyang embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction