Predicting <i>Salmonella</i> MIC and Deciphering Genomic Determinants of Antibiotic Resistance and Susceptibility

<i>Salmonella</i> spp., a leading cause of foodborne illness, is a formidable global menace due to escalating antimicrobial resistance (AMR). The evaluation of minimum inhibitory concentration (MIC) for antimicrobials is critical for characterizing AMR. The current whole genome sequencin...

Full description

Bibliographic Details
Main Authors: Moses B. Ayoola, Athish Ram Das, B. Santhana Krishnan, David R. Smith, Bindu Nanduri, Mahalingam Ramkumar
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Microorganisms
Subjects:
Online Access:https://www.mdpi.com/2076-2607/12/1/134
Description
Summary:<i>Salmonella</i> spp., a leading cause of foodborne illness, is a formidable global menace due to escalating antimicrobial resistance (AMR). The evaluation of minimum inhibitory concentration (MIC) for antimicrobials is critical for characterizing AMR. The current whole genome sequencing (WGS)-based approaches for predicting MIC are hindered by both computational and feature identification constraints. We propose an innovative methodology called the “Genome Feature Extractor Pipeline” that integrates traditional machine learning (random forest, RF) with deep learning models (multilayer perceptron (MLP) and DeepLift) for WGS-based MIC prediction. We used a dataset from the National Antimicrobial Resistance Monitoring System (NARMS), comprising 4500 assembled genomes of nontyphoidal <i>Salmonella</i>, each annotated with MIC metadata for 15 antibiotics. Our pipeline involves the batch downloading of annotated genomes, the determination of feature importance using RF, Gini-index-based selection of crucial 10-mers, and their expansion to 20-mers. This is followed by an MLP network, with four hidden layers of 1024 neurons each, to predict MIC values. Using DeepLift, key 20-mers and associated genes influencing MIC are identified. The 10 most significant 20-mers for each antibiotic are listed, showcasing our ability to discern genomic features affecting <i>Salmonella</i> MIC prediction with enhanced precision. The methodology replaces binary indicators with k-mer counts, offering a more nuanced analysis. The combination of RF and MLP addresses the limitations of the existing WGS approach, providing a robust and efficient method for predicting MIC values in <i>Salmonella</i> that could potentially be applied to other pathogens.
ISSN:2076-2607