HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes

Abstract Background The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mi...

Full description

Bibliographic Details
Main Authors: Yu Li, Zeling Xu, Wenkai Han, Huiluo Cao, Ramzan Umarov, Aixin Yan, Ming Fan, Huan Chen, Carlos M. Duarte, Lihua Li, Pak-Leung Ho, Xin Gao
Format: Article
Language:English
Published: BMC 2021-02-01
Series:Microbiome
Subjects:
Online Access:https://doi.org/10.1186/s40168-021-01002-3
_version_ 1818419251866763264
author Yu Li
Zeling Xu
Wenkai Han
Huiluo Cao
Ramzan Umarov
Aixin Yan
Ming Fan
Huan Chen
Carlos M. Duarte
Lihua Li
Pak-Leung Ho
Xin Gao
author_facet Yu Li
Zeling Xu
Wenkai Han
Huiluo Cao
Ramzan Umarov
Aixin Yan
Ming Fan
Huan Chen
Carlos M. Duarte
Lihua Li
Pak-Leung Ho
Xin Gao
author_sort Yu Li
collection DOAJ
description Abstract Background The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. Results Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. Conclusions We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/ . Video abstract (MP4 50984 kb)
first_indexed 2024-12-14T12:35:36Z
format Article
id doaj.art-6a62358d67c749c99382697d2a9a8015
institution Directory Open Access Journal
issn 2049-2618
language English
last_indexed 2024-12-14T12:35:36Z
publishDate 2021-02-01
publisher BMC
record_format Article
series Microbiome
spelling doaj.art-6a62358d67c749c99382697d2a9a80152022-12-21T23:01:03ZengBMCMicrobiome2049-26182021-02-019111210.1186/s40168-021-01002-3HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genesYu Li0Zeling Xu1Wenkai Han2Huiluo Cao3Ramzan Umarov4Aixin Yan5Ming Fan6Huan Chen7Carlos M. Duarte8Lihua Li9Pak-Leung Ho10Xin Gao11Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)School of Biological Sciences, The University of Hong KongComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)Carol Yu Center for Infection and Department of Microbiology, The University of Hong KongComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)School of Biological Sciences, The University of Hong KongInstitute of Biomedical Engineering and Instrumentation, Hangzhou Dianzi UniversityKey Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province, Zhejiang Institute of MicrobiologyComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)Institute of Biomedical Engineering and Instrumentation, Hangzhou Dianzi UniversityCarol Yu Center for Infection and Department of Microbiology, The University of Hong KongComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)Abstract Background The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. Results Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. Conclusions We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/ . Video abstract (MP4 50984 kb)https://doi.org/10.1186/s40168-021-01002-3Antibiotic resistance genesDeep learningAntibiotic classResistant mechanismGene mobilityMulti-task learning
spellingShingle Yu Li
Zeling Xu
Wenkai Han
Huiluo Cao
Ramzan Umarov
Aixin Yan
Ming Fan
Huan Chen
Carlos M. Duarte
Lihua Li
Pak-Leung Ho
Xin Gao
HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
Microbiome
Antibiotic resistance genes
Deep learning
Antibiotic class
Resistant mechanism
Gene mobility
Multi-task learning
title HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_full HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_fullStr HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_full_unstemmed HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_short HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_sort hmd arg hierarchical multi task deep learning for annotating antibiotic resistance genes
topic Antibiotic resistance genes
Deep learning
Antibiotic class
Resistant mechanism
Gene mobility
Multi-task learning
url https://doi.org/10.1186/s40168-021-01002-3
work_keys_str_mv AT yuli hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT zelingxu hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT wenkaihan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT huiluocao hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT ramzanumarov hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT aixinyan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT mingfan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT huanchen hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT carlosmduarte hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT lihuali hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT pakleungho hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT xingao hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes