HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
Abstract Background The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mi...
Main Authors: | , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-02-01
|
Series: | Microbiome |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40168-021-01002-3 |
_version_ | 1818419251866763264 |
---|---|
author | Yu Li Zeling Xu Wenkai Han Huiluo Cao Ramzan Umarov Aixin Yan Ming Fan Huan Chen Carlos M. Duarte Lihua Li Pak-Leung Ho Xin Gao |
author_facet | Yu Li Zeling Xu Wenkai Han Huiluo Cao Ramzan Umarov Aixin Yan Ming Fan Huan Chen Carlos M. Duarte Lihua Li Pak-Leung Ho Xin Gao |
author_sort | Yu Li |
collection | DOAJ |
description | Abstract Background The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. Results Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. Conclusions We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/ . Video abstract (MP4 50984 kb) |
first_indexed | 2024-12-14T12:35:36Z |
format | Article |
id | doaj.art-6a62358d67c749c99382697d2a9a8015 |
institution | Directory Open Access Journal |
issn | 2049-2618 |
language | English |
last_indexed | 2024-12-14T12:35:36Z |
publishDate | 2021-02-01 |
publisher | BMC |
record_format | Article |
series | Microbiome |
spelling | doaj.art-6a62358d67c749c99382697d2a9a80152022-12-21T23:01:03ZengBMCMicrobiome2049-26182021-02-019111210.1186/s40168-021-01002-3HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genesYu Li0Zeling Xu1Wenkai Han2Huiluo Cao3Ramzan Umarov4Aixin Yan5Ming Fan6Huan Chen7Carlos M. Duarte8Lihua Li9Pak-Leung Ho10Xin Gao11Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)School of Biological Sciences, The University of Hong KongComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)Carol Yu Center for Infection and Department of Microbiology, The University of Hong KongComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)School of Biological Sciences, The University of Hong KongInstitute of Biomedical Engineering and Instrumentation, Hangzhou Dianzi UniversityKey Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province, Zhejiang Institute of MicrobiologyComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)Institute of Biomedical Engineering and Instrumentation, Hangzhou Dianzi UniversityCarol Yu Center for Infection and Department of Microbiology, The University of Hong KongComputational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)Abstract Background The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. Results Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. Conclusions We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/ . Video abstract (MP4 50984 kb)https://doi.org/10.1186/s40168-021-01002-3Antibiotic resistance genesDeep learningAntibiotic classResistant mechanismGene mobilityMulti-task learning |
spellingShingle | Yu Li Zeling Xu Wenkai Han Huiluo Cao Ramzan Umarov Aixin Yan Ming Fan Huan Chen Carlos M. Duarte Lihua Li Pak-Leung Ho Xin Gao HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes Microbiome Antibiotic resistance genes Deep learning Antibiotic class Resistant mechanism Gene mobility Multi-task learning |
title | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_full | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_fullStr | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_full_unstemmed | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_short | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_sort | hmd arg hierarchical multi task deep learning for annotating antibiotic resistance genes |
topic | Antibiotic resistance genes Deep learning Antibiotic class Resistant mechanism Gene mobility Multi-task learning |
url | https://doi.org/10.1186/s40168-021-01002-3 |
work_keys_str_mv | AT yuli hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT zelingxu hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT wenkaihan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT huiluocao hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT ramzanumarov hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT aixinyan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT mingfan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT huanchen hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT carlosmduarte hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT lihuali hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT pakleungho hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT xingao hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes |